=> s raffinose synthase 

L l 8 RAFFINOSE SYNTHASE 

=> s transgenic 

L2 52105 TRANSGENIC 

=> s 11 and 12 

L3 1 Ll AND L2 

=> d 13 



L3 ANSWER 1 OF 1 CAPLUS COPYRIGHT 1998 ACS 
AN 1998:217308 CAPLUS 
DN 128:318794 

TI Cloning of cDNA for raffinose synthase from 

cucumber, and its use for preparing raffinose or transgenic 

plants low in raf f inose-type oligosaccharides 
IN Oosumi, Chieko; Nozaki, Shinji; Kida, Takao 
PA Ajinomoto Co., Inc., Japan 
SO Jpn. Kokai Tokkyo Koho, 26 pp. 

CODEN : JKXXAF 
PI JP 10084973 A2 980407 Heisei 
AI JP 97-111124 970428 
PRAI JP 96-107682 960426 

JP 96-198079 960726 
DT Patent 
LA Japanese 

=> d 11 1-8 



Ll ANSWER 1 OF 8 BIOSIS COPYRIGHT 1998 BIOSIS 
AN 98:339293 BIOSIS 
DN 01339293 

TI Characterization and gene cloning of raffinose 

synthase from Cucumis sativus. 
AU Ohsumi C; Nozaki J; Kida T 

CS Ajinomoto Co. Inc., CRL . , Kawasaki 210, Japan 

SO 1998 Annual Meeting of the Japanese Society of Plant Pathologists, 
Tokyo, Japan, May 3-5, 1998. Plant and Cell Physiology 39 (SUPPL. ) 
1998. S131. ISSN: 0032-0781 

DT Conference 

LA English 

Ll ANSWER 2 OF 8 BIOSIS COPYRIGHT 1998 BIOSIS 
AN 98:339292 BIOSIS 
DN 01339292 

TI Purification of raffinose synthase from Cucumis 

sativus leaves . 
AU Nozaki J; Ohsumi C; Kida T 

CS Ajinomoto Co. Inc., CRL., Kawasaki 210, Japan 

SO 1998 Annual Meeting of the Japanese Society of Plant Pathologists, 
Tokyo, Japan, May 3-5, 1998. Plant and Cell Physiology 39 (SUPPL.) 
1998. S131. ISSN: 0032-0781 



DT Conference 
LA English 



LI ANSWER 3 OF 8 BIOSIS COPYRIGHT 1998 BIOSIS 
AN 90:197835 BIOSIS 
DN BA89: 104506 

TI RAFFINOSE SYNTHASE AND GALACTINOL SYNTHASE IN 

DEVELOPING SEEDS AND LEAVES OF LEGUMES. 
AU CASTILLO E M; DE LUMEN B O; REYES PS; DE LUMEN H Z 
CS DEP NUTRITIONAL SCI., UNIV. CALIFORNIA, BERKELEY, CALIF. 94720. 
SO J AGRIC FOOD CHEM 38 .(2). 1990. 351-355. CODEN: JAFCAU ISSN: 

0021-8561 
LA English 

LI ANSWER 4 OF 8 AGRICOLA 
AN 91:43724 AGRICOLA 
DN IND91018838 

TI Raffinose synthase and galactinol synthase in 

developing seeds and leaves of legumes. 
AU Castillo, E.M.; Lumen, B.O. de; Reyes, P.S.; Lumen, H.Z. de 
CS University of The Philippines, Los Banos, Los Banos, The Philippines 
AV DNAL (381 J8223) 

SO Journal of agricultural and food chemistry, Feb 1990. Vol. ^38, No. 
2. p. 351-355 

Publisher: Washington, D.C. : American Chemical Society. 

CODEN: JAFCAU; ISSN: 0021-8561 
NTE Includes references. 
DT Article 

FS U.S. Imprints not USDA, Experiment or Extension 
LA English 

LI ANSWER 5 OF 8 CAPLUS COPYRIGHT 1998 ACS 
AN 1998:217308 CAPLUS 
DN 128: 318794 

TI Cloning of cDNA for raffinose synthase from 

cucumber, and its use for preparing raffinose or transgenic plants 

low in raf finose-type oligosaccharides 
IN Oosumi, Chieko; Nozaki, Shinji; Kida, Takao 
PA Ajinomoto Co., Inc., Japan 
SO Jpn. Kokai Tokkyo Koho, 2 6 pp. 

CODEN : JKXXAF 
PI JP 10084973 A2 980407 Heisei 
AI JP 97-111124 970428 
PRAI JP 96-107682 960426 

JP 96-198079 960726 
DT Patent 
LA Japanese 

LI ANSWER 6 OF 8 CAPLUS COPYRIGHT 1998 ACS 
AN 1992:55624 CAPLUS 

DN 116:55624 ^ . 

TI Distribution and immunolocalization of stachyose synthase in Cucumis 

melo L 

AU Holthaus, Uta; Schmitz, Klaus 

CS Bot. Inst., Univ. Koeln, Cologne, W-5000/41, Germany 
SO Planta (1991), 185(4), 479-86 

CODEN: PLANAB; ISSN: 0032-0935 
DT Journal 
LA English 

LI ANSWER 7 OF 8 CAPLUS COPYRIGHT 1998 ACS 
AN 1990:115882 CAPLUS 
DN 112:115882 

TI Raffinose synthase and galactinol synthase in 
developing seeds and leaves of legumes 



Castillo, Eugenia M.; De Lumen, Benito 0.; Reyes, Pilar S.; De 
Lumen, Helen Z. 

CS Dep. Nutr. Sci., Univ. California, Berkeley, CA, 94720, USA 
SO J. Agric. Food Chem. (1990), 38(2), 351-5 

CODEN: JAFCAU; ISSN: 0021-8561 
DT Journal 
LA English 
OS CJACS 

LI ANSWER 8 OF 8 WPIDS COPYRIGHT 1998 DERWENT INFORMATION LTD 

AN 98-264858 [24] WPIDS 

DNN N98-208774 DNC C98-082361 

TI Raffinose synthase gene - useful for preparation 

of raffinose in transformed plant. 
DC C06 D16 P13 
PA (AJIN) AJINOMOTO KK 
CYC 1 

PI JP 10084973 A 980407 (9824)* 26 pp C12N015-09 

ADT JP 10084973 A JP 97-111124 970428 

PRAI JP 96-198079 960726; JP 96-107682 960426 

IC ICM C12N015-09 

ICS A01H005-00; C12N009-00 

=> s raffinose 

L4 68 45 RAFFINOSE 

=> s synthase 

L5 89205 SYNTHASE 

=> s 14 and 15 

L6 107 L4 AND L5 

=> s transgenic 

L7 52105 TRANSGENIC 

=> s 16 and 17 

L8 3 L6 AND L7 

=> d 18 1-3 



L8 ANSWER 1 OF 3 CAPLUS COPYRIGHT 1998 ACS 
AN 1998:217308 CAPLUS 
DN 128:318794 

TI Cloning of cDNA for raffinose synthase from 

cucumber, and its use for preparing raffinose or 
transgenic plants low in raff inose- type 
oligosaccharides 

IN Oosumi, Chieko; Nozaki, Shinji; Kida, Takao 

PA Ajinomoto Co., Inc., Japan 

SO Jpn. Kokai Tokkyo Koho, 2 6 pp. 
CODEN: JKXXAF 

PI JP 10084973 A2 980407 Heisei 

AI JP 97-111124 970428 

PRAI JP 96-107682 960426 
JP 96-198079 960726 

DT Patent 

LA Japanese 



L8 ANSWER 2 OF 3 CAPLUS COPYRIGHT 1998 ACS 



AN 1993:553371 CAPLUS 

DN 119:153371 . 
TI Nucleotide sequences for galactinol synthase from zucchini 

and soybean 

IN Kerr, Phillip S.; Pearlstein, Richard W.; Schweiger, Bruce J. ; 

Becker-Manley, Mary F . ; Pierce, John W. 
PA du Pont de Nemours, E. I., and Co., USA 
SO PCT Int. Appl., 79 pp. 

CODEN: PIXXD2 
PI WO 9302196 Al 930204 

DS W: AU, BB, BG, BR, CA, CS, FI, HU, JP, KP, KR, LK, MG, MN, MW, NO, 

PL, RO, RU, SD, US 
RW: AT, BE, BF, BJ, CF, CG, CH, CI, CM, DE, DK, ES, FR, GA, GB, GR, 

IT, LU, MC, ML, MR, NL, SE, SN, TD, TG 
AI WO 92-US6057 920724 
PRAI US 91-735066 910724 
DT Patent 
LA English 

L8 ANSWER 3 OF 3 WPIDS COPYRIGHT 1998 DERWENT INFORMATION LTD 

AN 93-058793 [07] WPIDS 

DNN N93-044772 DNC C93-026307 

TI Nucleotide sequence of galactinol synthase from zucchini 

and soybean - used to produce plants having altered levels of 
raffinose saccharide (s) and/or sucrose. 

DC C06 D16 P13 

IN BECKER-MANLEY, M F; KERR, P S; PEARLSTEIN, R W; PIERCE, J W; 

SCHWEIGER, B J 
PA (DUPO) DU PONT DE NEMOURS & CO E I 
CYC 37 

PI WO 9302196 Al 930204 (9307)* EN 80 pp C12N015-54 
RW* AT BE CH DE DK ES FR GB GR IT LU MC NL OA SE 
W: AU BB BG BR CA CS FI HU JP KP KR LK MG MN MW NO PL RO RU SD 
US 

AU 9224205 A 930223 (9324) C12N015-54 
ZA 9205592 A 940330 (9417) 76 pp C07HOOO-00 

EP 604458 Al 940706 (9426) EN C12N015-54 
R * FR 

US 5648210 A 970715 (9734) 24 pp C12Q001-68 

US 5773699 A 980630 (9833) A01H001-04 

ADT WO 9302196 Al WO 92-US6057 920724; AU 9224205 A AU 92-24205 920724; 

ZA 9205592 A ZA 92-5592 920724; EP 604458 Al EP 92-917170 920724, WO 
92-US6057 920724; US 5648210 A WO 92-US6057 920724, US 94-182060 
940823; US 5773699 A Cont of US 91-735066 910724, Cont of WO 
92-US6057 920724, Cont of US 94-182060 940823, US 96-712702 960912 

FDT AU 9224205 A Based on WO 9302196; EP 604458 Al Based on WO 9302196; 
US 5648210 A Based on WO 9302196; US 5773699 A Cont of US 5648210 

PRAI US 91-735066 910724; US 94-182060 940823; US 96-712702 960912 

IC I CM A01H001-04; C07H000-00; C12N015-54; C12Q001-68 

ICS A01H005-00; C07H021-02; C07H021-04; C12N001-21; C12N005-10; 
C12N015-11; C12N015-29; C12N015-82; C12P019-18; C12P019-34 

=> d 18 2-3 ab 



L8 ANSWER 2 OF 3 CAPLUS COPYRIGHT 1998 ACS 
AB The cDNA for zucchini leaf and soybean seed galactinol 
synthase (I) are cloned and expressed in a 
transgenic plant or a host cell using an appropriate 
regulatory sequence. The plant or plant cell transformed with the 
cDNA operatively linked to a promoter, e.g., the 35S promoter, may 
have a varied level of I and thus the desirable levels of 
raffinose saccharides and sucrose. Nucleotide fragments 
derived from the cDNA can be used for RFLP breeding of altered 
levels of raffinose saccharides and sucrose traits in 



soybeans. I may also be prepd. in a host, e.g., Escherichia coli, 
and used for prepg. galactinol. 

ANSWER 3 OF 3 WPIDS COPYRIGHT 1998 DERWENT INFORMATION LTD 
WO 9302196 A UPAB: 931119 

An isolated nucleic acid fragment (I) comprising a nucleotide 
seqeunce encoding plant galactinol synthase (GaS) is new. 

Also claimed are: (1) a chimeric gene (II) capable of bexng 
expressed in transformed plants, comprising (I) operably linked to a 
suitable regulatory sequence; (2) a chimeric gene (III) capable of 
being expressed in transformed plants, comprising a nucleic acid 
fragment encoding antisense RNA complementary to plant GaS; (3) a 
chimeric gene (IV) capable of being expressed in transformed 
microorganisms, comprising (I) operably linked to a suitable 
regulatory sequence; (4) plants transformed with (II) and (III) 
respectively; (5) seeds obtd. from the plants of (4); (6) a 
microorganism transformed with (IV); (7) methods for obtaining 
plants and plant cells contg. altered levels of raffinose 
saccharides and/or sucrose; (8) 2 methods for producing galactinol; 
(9) a method of Rf LP breeding of altered levels of raffinose 
saccharides and sucrose traits in soybeans; (10) a method of varying 
the level of D-galactase contg. oligosaccharides of sucrose in 
plants in response to end-user requirements. 

USE / ADVANTAGE - Transgenic plants that produce higher 
than normal levels of raffinose saccharides possess 
enhanced cold tolerance and in coniferous species will result in 
reduced post harvest needle abscission. Transgenic plants 
that produce lower than normal levels of raffinose 
saccharide are more easily digestible and have larger amts. of 
metabolisable energy. In sugar beets in particular, a decrease in 
raffinose saccharide content would improve sucrose 
crystallisation and overcome the need for expensive processing of 
sugar beet extracts. The enzyme can be used for prodn. of 
galactinol, and the nucleic acid fragments can be used as RELP 
markers in soybean genetic studies and breeding programs 
Dwg. 0/3 
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IMP2, a nuclear gene controlling the mitochondrial dependence of galactose, maltose 
and raffinose utilization in Saccharomyces cerevisiae. 

Dnnnini C. Lodi T . Ferrero L Puglisi PP 

Yeast 1992 Feb 8:2 83-93 

Abstract 

The MP2 gene of Saccharomyces cerevisiae is involved in the nucleo-mitochondnal control ot maltose, 
galactose and raffinose utilization as shown by the inability of imp2 mutants to grow on these carbon 
sources in respiratory-deficient conditions or in the presence of ethidium bromide and erythromycin. The 
negative phenotype cannot be scored in the presence of inhibitors of respiration and oxidative 
phosphorylation, indicating that the role of the mitochondria in the utilization of the above-mentioned 
carbon sources in imp2 mutants is not at the energetical level. Mutations in the IMP2 gene also confer 
many phenotypic alterations in respiratory-sufficient conditions, e.g. leaky phenotype on oxidizable 
carbon sources, sensitivity to heat shock and sporulation deficiency. The IMP2 gene has been cloned, 
sequenced and disrupted. The phenotype of null imp2 mutants is indistinguishable from that of the 
originally isolated mutant. 

MeSH 

Amino Acid Sequence : Base Sequence : Carbohydrates : Galactose ; Genes, Funga l ; Genetic 
Com plementation Test : Maltose ; Mitochondria : Molecular Sequence Data ; Mutation ; Phenotype ; 
Placmirte ; Raffinose : Saccharomyces cerevisiae : Support, Non- U. S. Gov't ; 
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PubMed nucleotide query 



httpyAvvvwjicbijilmjiih.gov/h1bin-pos^ 



Entrez 



Nucleotide QUERY 



Other Formats: 
Links: | MEDLINE I 



FftSTft jfcraphic | 



Protein j Related Sequences 



LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
FEATURES 

source 



PLN 



18-AUG-1993 



gene 
CDS 



SCIMP2 2071 bp DNA 

S. cerevisiae IMP2 gene. 

X61928 S94332 } 

g3829 

IMP2 gene. 

baker f s yeast. 

Saccharomyces cerevisiae 

Eukaryotae; mitochondrial eukaryotes; Fungi; Ascomycota; 
Hemiascomycetes; Saccharomycetales; Saccharomycetaceae; 
Saccharomyces . 

1 (bases 1 to 2071) 
Lodi,T. 

Direct Submission 

Submitted (04-SEP-1991) T. Lodi, Inst di Genetica, Viale dele 
Scienze, 1-43100 Parma, ITALY 

2 (bases 1 to 2071) 

Donnini,C, Lodi,T., Ferrero,I. and Puglisi,P.P. 

IMP2, a nuclear gene controlling the mitochondrial dependence of 
galactose, maltose and raffinose utilization in Saccharomyces 
cerevisiae 

Yeast 8 (2), 83-93 (1992) 
92221693 

Location/Qualifiers 
1. .2071 

/organism= M Saccharomyces cerevisiae" 

/db_xref="taxon: 4932" 

/ chromosome^ " IX " 

697 . . 1635 

/gene="IMP2 M 

697.. 1635 

/gene="IMP2 M 

/codon_start=l 

/db_xref="PID:g3830" 

/db xref="SWISS-PROT:P32351" 

/translation="MERGHRERGRSKKKRGERDSNVSSLSRSRSRASSRSRVREEEFL 
KWTVLRQDPSMRLRWDVDSEEEGEGNDEDDDDGDGDDMDEEESDEEQVSDIENDLEI 
DEE FHYDLGMKVLPNFCTS INEVLDSSKPWI AKYE I S IRGHENEGVSLEQLDGGYVRA 
MQLLTKGAGAEAGNQRSFILYTDLSSESTYALTYLMGAAVNQGDTVYIVHWEPSKPTD 
DSQMFANVARI RKHVMHLFDCVAGVLHDLHVWLS LTHPY PKHLLNEMI HGLKPVALC 
CSLSVI LSTLQNFVCSVPI LAVRKKLKRAKRKGI SE " 



BASE COUNT 565 a 

ORIGIN 

1 ggtcgacgaa 
61 ttccgaggga 
121 ctcgcggcat 
181 caggtgtgca 
241 agtagcgtgc 
301 atacttgatt 
361 gataegttte 
421 ttctttgetc 
481 tattttatct 
541 gactattgaa 
601 agaagagcat 
661 ggaaacgeca 
721 ggtegtagea 
781 agaagcaggg 
841 ttgaggcagg 
901 gagggcaacg 



534 c 

ccaaagctgt 
agcatttcct 
gctaggttac 
agcttgtcca 
gggtcggggt 
ctgtgaatgg 
aceggegtag 
gtttaatget 
ttttaagaaa 
agggtgagta 
attgetgact 
ccacggtgga 
agaagaaaag 
ccagtagccg 
acccctcgat 
acgaagatga 



534 g 438 t 



tgcccaggta 
gcagccactg 
cataccttct 
acactcccat 
cagaacagtg 
ctagtgacga 
agaaggtggc 
tgttcatatg 
gctttaaagg 
ccaaaagaac 
aaactgaegg 
gttcgactcg 
aggegaaega 
cagcagagta 
gcggttgagg 
cgacgacggc 



atactgeaat 
aggcaaacgt 
aggaccagga 
cagcccgtta 
cgatatcaac 
ttggaagtcc 
gtgcggccaa 
tgcgttcttt 
teeggaaaaa 
caacaagaga 
aaaacaatcg 
gagcagatgg 
gactcaaacg 
. agggaggaag 
gtcgtggatg 
gaeggegacg 



tcaggcacca 
tcatccagct 
agaggeggtg 
accatggega 
gtagtgtact 
agcgttgtct 
tctacacgat 
tgcactttgc 
aggcttgatc 
aacaaccaag 
aatctacaca 
aaaggggeca 
tgagcagtct 
agttcctcaa 
tggattctga 
atatggacga 



cctcatggta 
tgtgatgcca 
tetegtcgat 
tcgacgacgc 
tgtgaagatg 
gcgtatcgaa 
ccagagacat 
actttcttcc 
aaacacaaga 
tacgeaatge 
gcatcaaaac 
cagggaaaga 
gtcgeggteg 
gtggaccgtg 
agaggaaggt 
ggaagagtcc 



lof2 



9/10/98 8:24 AM 



PubMed nucleotide query 



httpyAvwwjicWjdmjiih.gov/hibiih 



961 gatgaagagc aagtgagcga tatagagaac 
1021 gatctgggca tgaaagtgtt acccaacttt 
1081 agcaagccct ggatagccaa gtacgagatc 
1141 tctctggagc aactcgacgg aggctacgtc 
1201 ggcgcagagg cggggaacca aaggtccttc 
1261 acctacgccc tgacctatct catgggcgca 
1321 gtccactggg agccctcgaa gcccacggac 
1381 atcagaaagc acgtcatgca cctgtttgac 
1441 gtcgtcgtcc tctccttgac ccatccgtac 
1501 ggcctcaagc cagtcgccct gtgctgctcc 
1561 ttcgtctgct ctgtgcccat cctcgcggtt 
1621 ggcatcagcg agtgaccaat aatcactgca 
1681 tacagcaaca gaccttatgt cttttctctg 
1741 cagttttggc agggggaatt tcacaacccg 
1801 agaacagcca caaataggga actttggtct 
1861 ccgtgctatt gccatcactg ctacaagact 
1921 gagaagaaga gctgccggtg cagctgctgc 
1981 gatgactagc caaggtgata ggccgttagt 
2041 ccccaccgcc gctccaccgc aggtctctag 

// 



gatttagaga ttgacgagga gttccactac 
tgtaccagca taaatgaagt gctagactcc 
agcatccgtg gccacgaaaa cgaaggcgtg 
agagccatgc aactactcac caagggtgcc 
atcctctaca cggacctgag cagcgagtcc 
gctgtcaacc agggagacac cgtctacatt 
gactcccaga tgttcgccaa cgttgccaga 
tgcgtcgcgg gcgtgctgca cgacctgcac 
ccaaaacacc tcctcaacga gatgatccac 
ctctcggtca tcctgtccac tctgcagaac 
agaaagaagc tgaaacgtgc caagcgcaag 
gtaattcctt tttagcaaca catacttata 
ctccgatacg ttatccaccc aactttattt 
cacgctaaaa attgtattta aacttaaaaa 
aaagcaagga ctctccctcc cttatcttga 
aaatacgtac taatatatgt tttcggtaac 
catggccaca gccacgggga cgctgtactg 
gcacaatgac ccgagctaca tggtgcaatt 
a 
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PubMed nucleotide <queiy 
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mm 



Entrez 



Nucleotide QUERY 



Other Formats: 


FASTA 1 Graphic | 


Links: Protein f 


Related Sequences! 



LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 
FEATURES . 

source 



DNA BCT 24-MAY-1994 

sucrose and raffinose operons. 



enzyme Ilabc; 
sucrase . 



f ructokinase; 



K. K. , Institute for Biological 
Kerklaan 30, Haren, The 



repeat_unit 



repeat_unit 



repeat_unit 



terminator 



gene 



CDS 



PPSURFOP 21839 bp 
P. pentosaceus (PPE1. 0) 
Z32771 
g493728 

alpha-galactosidase; alpha-glucosidase; 
insertion element; permease; regulator; 
Pediococcus pentosaceus. 
Pediococcus pentosaceus 

Eubacteria; Firmicutes; Low G+c gram-positive bacteria; 
Lactobacillaceae; Pediococcus . 

1 (bases 1 to 21839) 

Leenhouts , K.K.J. , Bolhuis, A. A. , Kok,J.J. and Venema, G. G. 

The sucrose and raffinose operons of Pediococcus pentosaceus PPE1.0 

Unpublished 

2 (bases 1 to 21839) 
Leenhouts, K.K. 
Direct Submission 

Submitted (27-APR-1994 ) Leenhouts 
Sciences, Department of Genetics, 
Netherlands, 9751 NN 

On May 26, 1994 this sequence version replaced gi: 475962. 
Location/Qualifiers 
1. .21839 

/organism=" Pediococcus pentosaceus" 
/strain= M PPEl. 0" 
/db_xref="taxon: 1255" 
487. .510 
/note="DR2" 
/citation=[l] 
/ rpt_t ype=DIRECT 
488. .561 
/note="IRl" 
/citation= [1] 
/ rpt_type= INVERT ED 
513.. 561 
/note="DRl" 
/citation=[l] 
/rpt_type=DIRECT 
2724. .2760 

/standard_name=" terminator 4" 
/citation=[l] 
complement (2929. .3864) 
/gene= ,, rafR ,t 
complement (2929. .3762) 
/gene="rafR" 

/standard_name=" regulator" 
/function=" regulation raf f inose-operon" 
/citation=[l] 
/codon_start=l 
/transl table=ll 
/product=" regulator" 
/ db_x r e f = " PID : g475963 " 
/db xref="SWISS-PROT: P43465 " 

/trInslation="MNGEYKTLANKSFESNVLFFGQEACLPNYTYKGNNVRDSYVIHY 

I QE GKGT FAAANH PATVLKAGD I F I L PKGT PC F YQADNDQ PWKY FWI GFS AGI RI E AM 
LSGSLLAQKCYLRQVQNGHIYADLSELYKVLHIPNSLINDVLLGSLIYRLFYDLLRWY 
PADATNIKVKSTEQFNLAVSYLQENYSTGCTIMDLCHYLNLSRSYLYTLFKTHANTSP 
QKLLTKLRLEDAKQRLSTSNNSVQS I ANMVGYKDS FTFSKAFKRYSGAS PS Y YRKSI G 



lof 11 
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PubMed nucleotide query 



http://wwwjicbijtojii^ 



-10_signal 
-35_signal 
-35_signal 
gene 

-10_signal 
CDS 



gene 
CDS 



terminator 



I" 

complement (3837. .38 42) 

/gene="rafR" 

/citation=[l] 

complement (3859. .3864) 

/gene= M rafR u 

/citation=[l] 

3909. .3914 

/gene="rafP" 

/citation=[l] 

3909. . 5918 

/gene="rafP" 

3932. .3937 

/gene="rafP" 

/citation=[l] 

3993. .5918 

/gene="rafP" 

/standard_name="permease" 

/function="raf finose transport protein" 

/citation= [1] 

/codon_start=l 

/transl table=ll 

/product="permease" 

/ db_x r e f = " P I D : g 475964 " 

/db xref ="SWISS-PROT : P43466 " 

/translation="MQEEHNYKWVGGRLIYGFGAKGNDAFYSILSGYLIIFITSHLFD 
TGNKALDNRMVSLVTLIIMVLRIVELFIDPFIGNAIDRTKNSPGHFRPTAT^WGGTVSS 
1 1 LLLLFTNLGGLYAKNAMI YLWFAI LY ITMDI FYS FKDVGFWSMLPS LTTDSRERE 
KT AT FAR L GS T I GGGLVGV LVMPAV IFF S AKAT S T GDNR GW F I FAL 1 1 C L I AL I S AWG 
VGLGTREVDSDIRKNKQDTVGVMEIFKALAKNDQLLWAAIiAYLFYGVGINILGSLEVY 
YFTYIMGKPKSFSILSIINIFLGLIATSLFPVLSKKFSRKGVFAGCLVFMLGGIAIFT 
IAGSNLWLVLLAATMFGFPQQMVFLVVLMVITDSVEYGQLKLGHRDESLALSVRPLID 

KFGGAI SNGWGQ I AI I S GMTTGATAS S I TAAGQLH FKLTMFAF PALMLL I AI GI FS K 
QIFLTEEKHAEIVAELERTWRTKFDNTTDQVAEKWTSLDLATPIAGQVIPLAQVNDP 
TFAAGTLGDGFAIKPSDGRILAPFDATVRQVFTTRHAVGLVGDNGIVLLIHIGLGTVK 
LRGTGFISYVEEGQHVQQGDELLEFWDPTIKQAGLDDTVIMTVTNSTEFTMMDWLVKP 

GQAVKAT DN I LQLHTKA" 
5976. .8177 
/gene="agaR If 
5976. .8177 
/gene="agaR" 

/standard_name= f, alpha-galactosidase ,, 

/citation=[l] 

/codon_start=l 

/transl_table=ll 

/evidence=experimental 

/product="alpha-galactosidase M 

/db_xref="PID:g475965 n 

/db xref="SWISS-PROT: P43467 " 

/translation="MSLITVDQANRVFHLHNQTLSYIFAVEQGGTLSHLYFGGHVDHY 
HGELRYPRVDRGFSGNLPGSTDRTFSRDTLPKEYSTAGEMDYHLPAAIVRHTDGANAL 
YLVYQGYRIEAGKPKLSGLPAAFVEDETEAETLTIVLVDQVSQVEFDLQYTIYRDRPV 
VTRSVQVCNQGDHAVNLEKVASMQIDFTDRQFETITLPGAHANERHPERGSINYGIQT 
FGSLRGTSSHQMNPFLALVDHTTTEFSGDAYGFNLVYSGNHAFELEKDQLDQLHLMVG 
INSYNFNWQLKAGATFQTPEVLMVYTNKGLNAMSQAYHHLIRERWRSEFKNQERPIV 
VNNWEATFFDFNEAKLKPIVDEAKQLGIEMFVLDDGWFGHRDDDNSSLGDWQVDHRKF 

PQGLNHFVKYVHEQGLKFGIWLE PEMI S YDSKLYQQHPDY LMQVPGRS PS PSRNQY I L 
DLGRQAVRNNIFDQLDQLLKSKQIDYIKWDMNRHLSDIYSVALPPERQGEVYHRYVLG 
LYELLERLTTAYPHILFEGCSGGGGRFDAGMAYYMPQIWASDNTDAVARLTIQYGTSL 
AYPISLATAHVSVSPNQQTGRETSMSTRSAVAASGVLGYELDLTQLSSADKQIVQKQV 
VQYKQIRPLIQFGEFYRLKSPITSNQAAWMFVSPQQDEAIVMVFNLTSYAQPSLTKTK 
LVGLNPKLNYQNIATKAIFGGDELMQLGFYDPWYQDYTTKVYHFKAVTEN" 

8265. .8289 

/standard_name= n terminator 3" 
/citation=[l] 
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repeat_unit 

gene 
CDS 



repeat_unit 
repeat_unit 

terminator 

gene 
CDS 



gene 
CDS 



9403. . 9426 

/note="DR2" 

/citation=[l] 

/ rpt_type=DIRECT 

complement (9519. .10424) 

/gene="iso-IS30" 

complement (9519. . 10424) 

/partial 

/gene="iso-IS30" 

/standard_name="IS-element H 

/function="IS-30 (E.coli) like IS element" 

/citation=[l] 

/codon_start=l 

/transl table=ll 

/product="insertion element" 

/db xref="PID:g475966" 

/translation="SDAVRTKRSASLKWCLPQTHLLGADLLSSITYSERIKIETFCEL 

GLSNIQMGVRLNRSPSTISYELSRCQPYQAELAQTDAEYKRSQCGRKTKLSDELKQKI 

LNHLRLSWSPGMIAHEFKLATKSIYNWLNQGRIGFSLNDLPEHGVRQRRNVDQRSKYN 

QSLGRSIEQRPMMFNQRNRIGDFELDTWGPRGHSKAVLLTLIDRKSRFLWAYRLKDR 

TTATVNEALTKFLTT FNGPVHS FTVDRGTE FS GLVS LE SQYGI KT Y Y CHAYT PADVW 

MNALIGIYWFILKGLVLSTLVLKI" 

10396. . 10442 

/note="DRl" 

/citation=[l] 

/ rpt_type=DIRECT 

10397 . . 10443 

/note="IR2" 

/citation=[l] 

/ rpt_type= INVERTED 

10681. .10716 

/standard_name="terminator 2" 
/citation=[l] 
complement (10727. .11593) 
/gene="scrK" 

complement (10727. .11593) 
/gene= M scrK" 

/standard_name= !l fructokinase I! 

/function=" fructose 6-phosphotransf erase" 

/citation= [1] 

/codon_start=l 

/transl table=ll 

/product="f ructokinase" 

/db_xref="PID:g475967 M 

/db xref="SWISS-PROT: P43468 " 

/ 1 ran slat ion= "MLLGA.IEAGGTKE*VCATGAENGQVSDRI S I PTTTPVETMTAVDD 
YFTTHPVDAIGIGSFGPIGVNPHDPKYGYITTTPKPGWGDFDFLGHLKSQFNIPLYWT 
TDVNEAAYGESMI GIAKDVPNS I YMTI GTGVGAGVISQNHI FNGRTHTELGHMRLNRL 
PGDDFKSNCPYHDICLEGLAAGPAVGKRTGKAGKDIPVDDPVWPIITDYIAQACVNLT 
VAFAPDKIILNGGVMNQRQLFPMIREKFAAYLNGYEEVPPLDDYIVPAGLGNNSGIAG 

GLLLAQAALKNA" 

complement (13852. .15989) 

/gene="scrA" 

complement (13852. .15807) 

/gene= f, scrA" 

/standard_name="enzyme User" 

/function="enzyme II of the PTS system, sucrose specific" 

/citation=[l] 

/codon_start=l 

/transl table=ll 

/evidence=experimental 

/product="enzyme Ilabc" 

/db_xref="PID:g475968" 

/db xref="SWISS-PROT:P43470" 

/translation="MNHQEVADRVLNAI GKNNIQAAAHCATRLRLVIKDESKIDQQAL 
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DDDADVKGT FETNGQYQI 1 1 GPGDVDKVYDALI VKTGLKEVT PDDI KAVAAAGQNKNP 
LMDFLKVLSDIFIPIVPALVAGGLLMALrW 

AMASAPFTFLPILLGFSATKRFGGNPYLGATMGMIMVLPSLVNGYSVATTMAAGKMVY 
WNVFGLHVAQAGYQGQVLPVLGVAFILATLEKFFHKHIKGAFDFTFTPMFAIVITGFL 
TFTIVGPVLRTVSDALTNGLVGLYNSTGWIGMGIFGLLYSAIVITGLHQTFPAIETQL 
I^NVAKTGGSFIFPVASMANIGQGAATLAIFFATKSQKQKALTSSAGVSALLGITEPA 
I FGVNLKMKFPFVFAAIASGIASAFLGLFHVLSVAMGPASVIGFI S I ASKS I PAFMLS 
AV I S FWAF I PT F I YAKRT LGDDR DQVKS P APT S TV I NVNDE 1 1 S APVT GAS E S LKQV 
NDQVFSAEIMGKGAAIVPSSDQWAPADGVITVTYDSHHAYGIKTTAGAEILIHLGLD 
TVNLNGEHFTTNVQKGDTVHQGDLLGTFDIAALKAANYDPTVMLIVTNTANY7\NVERL 
KVTNVQAGE QLVALTAPAAS SVAATTV " 
-10_signal complement ( 15872 . .15877) 

/gene="scrA u 
/citation= [1] 
-35_signal complement ( 158 94 . .15989) 

/gene="scrA" 
/citation- [1] 
15993. . 17568 
/gene="scrB" 
-35_signal 15993. .15998 

/gene= u scrB" 
/citation= [1] 
-10_signal 16017 . . 16022 

/gene="scrB" 
/citation= [1] 
CDS 16063.. 17568 

/gene="scrB M 

/standard_name="sucrose-6-phosphate hydrolase" 

/citation= [1] 

/codon_start=l 

/transl table=ll 

/evidence=experimental 

/product="sucrase" 

/db xref="PID:g 475969 " 

/db_xref="SWISS-PROT: P43471 " 

/translation="MIWNRKTRYTPYEQWPATKLPQLVAQARQSKWRMQHHIQPTSGL 
LNDPNGFSYFDGQWHLFYQVFPFGPVHGLKSWQHVTSKNLVDWHDEGLAIRPDTPYDS 
HGAYTGTALPIDDQLFIMYTGNVRTADWQRESYQLGAWMDTDNHIKKLSRPLIAHAPA 
GYTSSFRDPDLIRNDHGYYALIGAQTTTEIGAI LVYFSKDLTTWTCQGELNVPANARG 
YMIECPKSGLDRSTARLIVLSQGLSQATIPYQNIYPNMYLVADQLNIiAQAQFTEPHAL 
TQLDDGFDVYATQAINAPDGRALAVSWIGLPEISYPTDRENWAHCLSLVKELTLKDGH 
LYQNPVAAVDDLRTTAHDLVFEQQRATVAALNGSFELLLTVPADKTVTVNIADQQESG 
QLQVTVDANHGQVMIDRRHTGNSFAEDYGQTRQVELTAHKTIKIRLIIDVSVFECYID 
NGYSVMTGRFFLNATPSRLWQGDTTAVTGKVWEWRQSEHTGVDNNETKIK" 

gene 17549. .18529 

/gene="scrR" 
CDS 17549. .18529 

/gene= n scrR" 

/standard_name=" regulator 11 

/function= !t regulator of the sucrose operon" 

/citation=[l] 

/codon_start=l 

/transl table=ll 

/product=" regulator" 

/db_xref= ,, PID:g 475970 " 

/db xref="SWISS-PROT: P43472 " 

/translation="MKPKLNDVAKLAGVSATTVSRVINNHGYLSSQTKEKVFAAMREL 
HYQPNNMARSLQGKNTRLIGVIFSDISNPFFGELVSRIEKILFAKNYKVILCNSADDP 
QKERDYLQMLMANQVDGIIAGAHNLGIEEYQQYGLPIISFDRYLSDNIPIVSSDNYQG 
GWLATQTLHQAGATNVAI FTGKSHAGS PTNGRREGYEAYLTAQQLTPHVHELPFELTP 
ALK^IMEIKTIMTQHQYDGIFCSDDIlAALLVL^^VAQQLSLTVPEQLRWGYDGTALIRD 
YHSELTTVEQPLADISTLLVSLLLQRIEDANCTLESKYTLPVKLIKGFTA" 
18550. .20223 
/gene="agl" 
CDS 18550.. 20223 



gene 
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terminator 
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CDS 



/gene="agl" 

/standard_name= ,l alpha-glucosidase 1 ' 

/citation=[l] 

/codon_start=l 

/transl table=ll 

/evidence=experimental 

/product= ,, alpha-glucosidase" 

/db_xref = n PID : g 493729" 

/db_xref ="SWISS-PROT : P43473 " 

/translati on= "MAAT I KWWQQAWYQV Y PRS FQDTNHDGI GDLKGI TAHLDY LKQ 

LGIDVIWLNPIYRSPNDDNGYDISDYQQIAADFGTMADFDELLQAAHDRGLKIIMDLV 

VNHTSDEHPRFKRSRQDRTNQYRDFYFWRSGNGKKAPNNWEAAFGGSAWQYDEQTQQY 

YLHTFSTKQPDLWENPTLRESVYTMMTWWLNKGVDGFRMDVINQISKLPGLPDGPLK 

PHSQFGDARVTNGPRVHEFLQEMNQEVLSQFDIMTVGETHGVTPADALKYAGADQHEL 

DMVFEFQHLRLDNSQHGLGKWSTRKTPLVALKKVISDWQVGLEGRAWNSLFWNNHDTP 

RAVSRFGDDRPAYRVRSAKMLATCLHLLQGTPYIYQGEELGMTDAHFTELASYRDIES 

LSAYRDLVTERQLLSPADMMARLAAASRDNSRTPMQWDTEVNAGFSDAAPWLTVNPNY 

RQINAAAALADPDSVWYYYQHLIQLRHQYPSVTLGSFELLWADDPQYSYMHGNGKADL 

ASLLQFHSRDTVPTTGSISDPTAKCLISNYGEQQPNKLRPYEAWVYQLA" 

20341. .20389 

/citation=[l] 

21090. .21677 

/gene="iso-IS3" 

21090. .21677 

/gene="iso-IS3" 

/standard_name="IS element" 

/function="IS3 (E.coli) and IS981 (L.lactis) like IS 

element" 

/citation=[l] 

/codon_start=l 

/transl table=ll 

/product="insertion element" 

/db xref="PID:g 475971 " 

/translation="MCRILGVSRAQYYRYRSPKPSKRRAEDADLKQRILRIFAEFKQR 
YGVMKIHHELNLELQPLQLRCSPRRISRLMKELDIHSVTWKWKAASASKTKVEQRPN 
LLKQDFSTTGLNQKWTADMTYIQTKRNGWCYLSTIMDLHSRRIIGYSFSKKMATDLVL 
KTLESAVKNRTITGGPDYPYGFRITGYQRLITINV" 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 



6181 a 4374 c 5015 g 6269 t 



gcatgcttga 
ggagggcaac 
tgatgttgac 
tctcgtcgaa 
cgatatagcc 
tcaggacggt 
gggtcgtgga 
gggtaaggct 
cagttatggt 
ggattaattt 
cctagtgata 
cgtgatatgt 
catttgcagt 
ataaactatc 
gggtgcatgg 
gccgcatgaa 
gaattgatga 
caactaatga 
atgttcagta 
ccacttgata 
ccaaggtgtc 
cttgataggg 
aatattatta 
ataatttagt 
agaaagaaac 
cggctaattg 



ctgggtgctt 
acggttgctt 
agataaatga 
ttcggtcatg 
cggatcagtc 
atccaaatac 
aagcttgtag 
ttgatacaat 
agattgtaaa 
tacaatctac 
aattgtctaa 
tgggtgcaat 
taatgaattc 
aacagttcca 
cgacgtcatg 
aaaatcagac 
caatttagca 
agctatacgc 
tgtgatgtgg 
gacaaacatt 
gttgccagct 
tatttaagga 
attggcttcg 
ataagaacct 
tacagcaaca 
agggccccaa 



agccgcaaat 
taatcaacaa 
cgattaaggt 
tgcgaagtaa 
acgtagcgca 
tgatagagaa 
ccggttgctt 
ttcaaaacga 
attaatccga 
cttttaccaa 
acgcttgcaa 
gattggcgcg 
aaacagcagt 
gataaagtag 
acgacacgaa 
atcaataaaa 
acgcctatga 
aatagcaacg 
ttaaccgagg 
tgcctatgga 
ggtagcctgt 
acaaaaaaag 
ctcacaacaa 
taaatgaagc 
acaattactg 
gatccgagta 



gatcaggaga 
agaccagttc 
cggccaccag 
acgggacgct 
agttgttttc 
tcgttggatc 
gaccagcgaa 
tcactcctct 
acgctgatct 
gcaattgatg 
gcaatctttg 
gtatgaatat 
atatcgatgg 
ccaaccaagc 
tgaaagaaac 
tgacccgcag 
accaaatgct 
gtggtaagaa 
aagattgatc 
caagcaccga 
gatgaagatg 
cccacttagt 
agtcgagggg 
ttaaaaacta 
ctccaataat 
tagcttctcc 



gaagcatcgc 
gatagtttca 
tttgcgtgcg 
ttggtaatta 
gtcgtggcga 
gtagttcagc 
ttgttcgttg 
tcttttcact 
tttttgtccg 
aaatgctagc 
cccaagcatc 
tgctacagct 
ctataataat 
tgagtattct 
cctgaacaag 
tattcctcaa 
atccagaatt 
tagaacgtat 
gtgtttgtga 
ttccagtcac 
ggaatctgct 
ttaaaactaa 
tgctaactaa 
attctttgat 
tgaaggaata 
aatagaggaa 



gactaggaaa 
tcttcctgtt 
aaggccatct 
aagttttcgc 
ctgcccggtt 
ggcttatcaa 
atttagactt 
tttagtatag 
aacagcgttc 
taacgctagt 
tatttctcgg 
aaaagtgaat 
ttaagcccta 
caacgaattt 
ggtttgagtc 
agcggtgatc 
catacgttaa 
gatgaacaga 
tatttgtgaa 
ggatactcat 
gtctggtgaa 
gtgggtcagg 
gttagcatct 
ttaccgacaa 
attgccattc 
ccaactaatc 
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1561 cggcaacaat attagcaatc caccccattg 
1621 tagcaccaat gatggcacca ataattaaaa 
1681 tcttattttt cgagggatgt tttttagatt 
1741 ctaataaaag ttgagatata gaaatgataa 
1801 tcatatgaaa tcctccatta tctaaattgc 
1861 atatttgcat acaagttatt cgattgtgcc 
1921 cctttttctc aattacagac gttaaagaat 
1981 tttagtctac ggacttaaaa aggagttttg 
2041 gatcctaacc cagaacctaa gccaaacgat 
2101 gcgtggtgtg atagacggtt cttgcatcaa 
2161 gaatttcaat ggtacctgaa tttttcgtct 
2221 tggctaactt aattataaaa cgcgcgagaa 
2281 ccataataaa tttattctat tgaatttaaa 
2341 agtaactcaa gctagtttga atgaaatgtt 
2401 tgcgcctaaa gagatgcagt ggtcgatgta 
2461 tgggaaaata cagccgggat gagttatccg 
2521 gaagagttca tgcagactta tgtacatgtt 
2581 aagtgggtat gattgactaa atttctcaaa 
2641 taattaaata aacgcgattg ctgcgggatt 
2701 ataaacaaaa taattcttat ttgcaaaagc 
27 61 tttgtttaaa aaaggctgta tcagttaata 
2821 gcgttattac ggtggtacca taatttttac 
2881 cattagcaga aagacaaatc acacgggcaa 
2941 gcttttacga taatagctgg ggctagcacc 
3001 gaagctatcc ttgtagccta ccatattggc 
3061 tagacgctgt ttagcatctt ctaagcgcaa 
3121 agcgtgcgtt ttaaacaagg tgtagagata 
3181 gtccataatg gtacaaccag tactataatt 
3241 ttgttcagtt gactttactt ttatgttggt 
3301 atagaacaat cgatagatca aagaccctaa 
3361 gtgtaaaact ttataaagtt cagagagatc 
3421 taagtagcat ttctgagcta ataaagaacc 
3481 ggaaaaaccg atccaaaagt atttccacgg 
3541 tgttccctta ggtagtataa aaatatcgcc 
3601 tgcagcaaag gtacctttac cctcctggat 
3661 attacctttg taagtgtaat tcggtaggca 
3721 ttcgaagctt ttattcgcta aagttttata 
3781 taaaatatct tgctaaatat gtgaaaatgt 
3841 tactgctatt taaaaacatg tcaaattgtt 
3901 gattactatg ggcaacccaa tagacaggaa 
3961 acaaaaatga tttgagggca ggtaattcat 
4021 taggaggtcg cctaatctat ggttttggtg 
4081 tctccggata tttaattatc ttcattactt 
4141 tagataatcg aatggtcagt ttggtgacct 
4201 tattcattga tccctttatc ggtaacgcaa 
4261 tccggccatg ggttgtagtc ggtgggaccg 
4321 ctaatttagg cggcctctat gctaaaaacg 
4381 tgtatattac gatggatatt ttctattcgt 
4441 cttccttgac cactgattca cgagaacgcg 
4501 caactattgg tggtggcttg gttggtgtgt 
4561 ccaaggcgac tagtactggc gataatcgtg 
4621 tgatagcttt aatttcagcc tggggtgttg 
4681 ttcgtaaaaa taaacaagat acagttggtg 
4741 atgatcagtt gctatgggca gctttagcct 
4801 taggttcact ggaagtctat tatttcacgt 
4861 ttctatcgat cattaatatt ttcttaggtt 
4921 cgaaaaaatt cagtcgtaaa ggtgtttttg 
4981 ttgcgatttt taccattgct ggcagtaatt 
5041 ttgggttccc gcaacagatg gtctttttag 
5101 aatacggtca attgaagctg gggcatcgtg 
5161 tgatcgataa atttggtggc gctatctcta 
5221 ctggtatgac gactggtgcg actgcttctt 
5281 aattaacgat gtttgccttt cccgcgttaa 
5341 agcaaatatt cttaaccgaa gaaaaacacg 
5401 ggagaacaaa gttcgataat accactgatc 



attttccttt gctagtaatt gctccagcaa 
cccataacca atgcataaag catcatcctc 
tgttatagtt ttgaatttct ttgaaaatgg 
ttgataattt tgaccaaagt ccaaatcttt 
attcttctaa acctaattat aggcaattga 
attcttttta aaataggaat ttttgatttg 
aattgattta gttattgcac ttacaaacaa 
atgatgaaaa caaatttgca attttttgct 
tctaacccaa atcctgatgt gaccttttgg 
gacgaaaact tctttcattg tttgttgtag 
tggtgtcttt tttttaccat tttagctaca 
taaatttttg cgagaaaaag gaattaaaaa 
tgactatcgt gttgctaagt taagatatcc 
aatcaaactt ggtaatgaag ccgatattac 
ttcattgcgg catacagttg caacaagact 
tgggcggcag cgcgtatggg acatacgtta 
gatcagaatc gttcgttaga aatgcttgct 
cacacacacg aaacacacag aactcaaaga 
tcgaagctca actttgtatc ttgcagacgc 
cgacaatagc gcgttgattg tcggtttttg 
tcaacacaca gcaaaattga aaaaagtcgc 
aggacaacta ttccttatgt aattcgatag 
catagtggtc attatgattt agattccaat 
actgtagcgt ttaaatgcct tggagaaagt 
aatgctttgc actgaattat tactagtgct 
tttagttagt aatttttgtg gcgaggtatt 
gcttcgtgat aaattaagat aatgacagag 
ttcttgcaag taagaaacag ctaagttaaa 
ggcgtcagcg ggataccagc gcaataagtc 
tagaacatca tttatcaatg aattagggat 
ggcgtaaata tggccatttt gtacttggcg 
agaaagcatt gcttcgatac ggataccggc 
ttggtcattg tcagcttgat agaaacaagg 
ggcttttaat acggttgctg gatggttagc 
gtaatggata acatagctgt ctctgacatt 
agcctcttgc' ccgaaaaaga gtacattact 
ttcaccattc atatgagttc tccatttact 
ttttcaacta attaataaat agctacacta 
tactaataaa aataacaaaa aattgataaa 
ttataatgag ctttgttaat gaaaacggat 
ttatgcagga agaacataat tataaatggg 
caaaaggtaa tgatgctttc tatagtatcc 
cacacctttt tgataccggc aacaaagcac 
tgattattat ggtgttacga atcgttgagc 
ttgaccgaac taaaaactcg ccgggtcatt 
tctcttcaat tattctcttg ctattattta 
caatgattta tttggtcgtt tttgcaattt 
ttaaagacgt tggtttttgg tcaatgttac 
aaaaaactgc tacttttgcc cgcttagggt 
tggtcatgcc ggcagttatc ttcttttctg 
gttggttcat ttttgccttg atcatctgct 
gcttaggtac acgtgaagtt gatagtgata 
tgatggagat ttttaaggca ctggctaaaa 
atttattcta tggtgtgggc attaatattt 
atattatggg taagccgaaa tcattttcta 
tgattgctac atcgctattt ccagtattat 
ctggctgttt agtgttcatg ctaggtggta 
tatggctggt tctattagcg gcaactatgt 
tagttttgat ggtaatcacg gattctgttg 
atgagtcgct ggctttatca gtgcggccgt 
acggggttgt tggccaaatt gccattattt 
ctatcactgc tgcaggacaa ttacatttta 
tgctacttat cgctattggc attttttcaa 
ccgaaattgt ggctgaactt gaaagaacct 
aggtagcaga aaaagtagtt acatcacttg 
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5461 atttagctac accaatcgct gggcaagtga 
5521 ttgcggctgg aacgttaggt gacggatttg 
5581 ctccatttga tgcaacggta cgtcaagtat 
5641 gtgataatgg gatcgtctta ttgatccata 
5701 cgggatttat ttcttatgtt gaggaggggc 
5761 agttttggga tccaacgatc aaacaagctg 
5821 ctaattcaac cgaattcact atgatggatt 
5881 caactgataa tattttacag ttgcatacta 
5941 ttttaaaatt agtaattccc cgaggaggaa 
6001 cgaatcgcgt ttttcatttg cataatcaaa 
6061 gcggtacttt aagtcattta tattttggtg 
6121 gctatccacg agtcgatcgt ggtttttccg 
6181 tttcacgtga tactttgccg aaagaataca 
6241 cagcagcaat cgttcgtcat actgatgggg 
6301 atcggatcga agcgggtaaa ccaaagttaa 
6361 aaacagaggc agaaactttg actatcgtat 
6421 atctacaata tacgatctac cgtgatcgac 
6481 atcaaggtga tcatgcagtt aatttagaaa 
6541 atcgtcagtt cgaaacaatt acattacctg 
6601 ggggttcgat caattatgga atacagactt 
6661 aaatgaatcc atttttggcc ttggttgatc 
6721 atggttttaa cttggtctat tctggcaatc 
6781 accaacttca tctaatggtc ggtattaata 
6841 gcgcgacttt ccaaacacca gaagtattaa 
6901 tgagccaagc ctaccaccat ttgatccgtg 
6961 aagagcgtcc gattgtagtt aacaactggg 
7021 aattgaagcc gatcgtggat gaagccaagc 
7081 atggttggtt tggacatcgt gatgatgata 
7141 atagaaaatt tccgcagggc ttaaatcatt 
7201 aatttggtat ttggttagaa ccagaaatga 
7261 atccagatta tttaatgcaa gttcctggcc 
7321 ttttagatct tggtcggcaa gccgtacgaa 
7381 taaaatcgaa gcaaattgat tatattaaat 
7441 attccgttgc tttaccacca gaacgccagg 
7501 tgtatgaatt attggaacgg ttgacgacag 
7561 caggtggtgg cggtcgattt gatgctggga 
7621 gtgataacac tgatgcagtt gcgcgattga 
7681 caatttcgtt agctaccgcc catgtctcag 
7741 cgtcaatgtc gactagaagt gcagtagcgg 
7801 taacacaact cagttcagcg gataaacaaa 
7861 agattagacc actgattcaa tttggtgaat 
7921 atcaagcagc atggatgttt gtttcacctc 
7981 atctgacatc ctatgcccaa cccagtttga 
8041 agctgaatta ccaaaatatt gctactaagg 
8101 ttggcttcta tgatccagta gtctaccaag 
8161 cagtcactga gaactaactt ttacgtaaat 
8221 gtacacagac gaaagggctc atgggagttc 
8281 gcacgcgcta attcctttcg aaatacttca 
8341 gggagatggt tgagccgaga ttcagcttga 
8401 gattacccct taggaaagta acgtctgagt 
8461 ccgagctata agggtgggta taataagtct 
8521 tggtgaactc tgttccgtta tccgccgtaa 
8581 tagtattttg catgcataag ccacagatag 
8641 acactctcgc taccagcacg tttgccaatg 
8701 tgacgcattt ctactcgctt tggccgttct 
8761 tggtgcgagt ggtgctgggt catccgacgt 
8821 atctccaaca gttgcgcatc gatatagttg 
8881 ggcttaaata accggtgctt acgggcataa 
8941 tggcagaact tatcatcaaa gtaggctaag 
9001 cgtccgcagt tttgccgctt tatttcatag 
9061 tcaaagtagt gtttctgatt attaacgcgt 
9121 tcattattga cggtctggtg gcagacgcct 
9181 ccttcaatgt gccaagcggc gatttaaccg 
9241 aggcgtggtg tgatagactg ttcttgcatc 
9301 aggaacttca atgatacctg aatttttcgt 



ttccacttgc ccaagtcaat gatccaactt 
ctattaaacc tagtgatggc cgaatattag 
ttaccacacg acatgcagtt ggcttagtcg 
ttggtttggg aactgttaaa cttagaggaa 
agcatgtaca acaaggggat gaattacttg 
ggttagatga tacggttatt atgacagtga 
ggttagtcaa gccaggtcaa gccgttaaag 
aagcataatg tgccactggt tatcatttaa 
ataaaatgtc attaattacg gttgatcaag 
ctctttctta tatctttgca gttgagcagg 
ggcatgtgga ccattaccac ggtgaattgc 
gtaatttacc gggatcgact gatcgcactt 
gtacggctgg tgaaatggat tatcacttac 
ctaatgcatt atacttagtt tatcagggat 
gcgggctacc ggcagctttt gtggaggatg 
tagttgatca agtaagccaa gtcgagtttg 
cagtggtcac gcgctcggtg caagtctgta 
aagttgcttc aatgcaaatc gattttacgg 
gagcgcatgc caatgaacgc catccagagc 
ttggtagttt gcggggtact tccagtcatc 
acactacgac agagtttagc ggtgatgcat 
acgcatttga actggaaaag gatcaacttg 
gttataattt caattggcaa cttaaagctg 
tggtctatac aaataaaggt ttgaacgcga 
aacgagtagt acgtagtgag tttaaaaatc 
aagcaacatt ctttgatttt aacgaagcaa 
agttaggcat tgaaatgttt gttttagatg 
attcttcatt aggtgactgg caagttgatc 
ttgtgaaata tgtgcatgaa caaggcctta 
tttcatatga ttcaaagcta tatcagcaac 
gatcaccaag tccatcacgt aatcagtata 
ataatatttt tgaccagctt gatcaacttc 
gggatatgaa tcgccacctt tcagatattt 
gtgaggtcta tcatcggtat gttttaggtc 
cttatccgca tatcttattt gaaggttgct 
tggcttacta tatgccacag atttgggcta 
cgattcaata tgggactagt ttagcctatc 
tgagccctaa tcagcaaaca ggtcgtgaaa 
caagcggtgt cttaggttat gaattggatc 
tagtccaaaa gcaagttgta cagtataaac 
tttatcgctt aaaatcacca attactagca 
aacaagatga agcgattgtc atggtattta 
caaaaaccaa attagtcggt ttgaatccca 
cgatatttgg tggtgatgaa ttgatgcaac 
attacacaac aaaggtctat cattttaaag 
tgcaatagta agttaaccag tgccggtaaa 
caggtggatc aggcggcgcg gcctttaccc 
gctggtgtac agaacctaag taaacggcgc 
cgaaccagac tgggtgtagc tagatcgagt 
gattgtggac ctcattagtg gcgcgttcgg 
cggtaatacc ctcaagcgca gccgttagcg 
ctgtccgcat aacgttaccg aattcagccg 
cgcgttttac gttcggtaaa cgttaatagg 
acagtatcga tctcaaaatg accaaactcg 
tcaatactac gaccgaatag gcgctgatgt 
tttgtttttt ccagcagatc aaggttacgg 
tacaaagtct tcgtacagac cattttggcc 
ccaacggcag cgtccggcga ccagccctca 
aaggccgtca cttggctaaa ttttaatggg 
cgattctgag cggcttcagg actgtaaatt 
ttaacttgcg tgactcgacc acgcttgagc 
aatcgaatcg ctatttgccg tactgagcat 
cgttcgattt ccgataaatg gtgacctttt 
aagacgaaaa cttctttcat tgtttgttgt 
cttggtgtct ttttttacca ttttggctgc 
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9361 gtggctaact taattataaa acgcgcggat 
9421 atccgaattt ttggacaaat ttgtcagcat 
9481 tttaagcggt cgctggttaa tttggagtaa 
9541 ctcaaaacga gtccctttag gataaaaaca 
9601 accacgtcag ctggagtata agcatggcag 
9661 agtgatacta gaccgctaaa ctcagtgcca 
9721 ttaaaagtgg ttaggaactt agttagggct 
9781 aaccggtagg cccaaaggaa ccgtgatttg 
9841 ctatgcccac gaggaccaac gactgtatct 
9901 ttaaacatca tgggacgctg ttcaattgat 
9961 tggtcaacgt tacgccgttg gcgtacgcca 
10021 attctcccct gatttagcca attataaata 
10081 atcattcctg gtgaccagct tagacgtaaa 
10141 ctcagcttag ttttccgacc acattgtgat 
10201 tcagcctgat aaggttgaca tcgagataat 
10261 agccgaacgc ccatttggat attggacagc 
10321 cgttcggaat aggttatact agacaaaaga 
10381 caccatttta aggaagctga tctttttgtc 
10441 ccatgtacgt gtcgtatatt ccgcaaatca 
10501 cgctacaacc actcgtggca atgattaatg 
10561 agacttataa ggattggccc gttattattt 
10621 tcactgtttt aaccgtatat atgcattaaa 
10681 ccgctaatac gtttaaacac gtatcggcgg 
10741 aagcagcctg agctaacaat aggccaccgg 
10801 gaacgatata gtcatccagt ggtgggactt 
10861 tttcacgaat catgggaaag agctgccgtt 
10921 tatcaggtgc gaaggcgacg gtgaggttga 
10981 ttggccagac cggatcgtca acgggaatat 
11041 ctgctggccc agcagctaaa ccttctaaac 
11101 aatcgtcccc aggtaaccgg ttgagtcgca 
11161 taaaaatgtg gttctgacta atgacgccag 
11221 tgctgttagg gacgtcctta gcgattccaa 
11281 catcagtggt ccagtagagt gggatgttga 
11341 agtcacccca gccgggtttg ggagtcgtgg 
11401 taacgccgat ggggccaaat gaaccgatac 
11461 agtcgtctac cgcagtcatg gtttcaacgg 
11521 taacttgccc gttttcagcg cccgtggcac 
11581 cacctaaaag catagtgtgc ctcctgtagt 
11641 attcttaatt cgaaattttg ttaacgatta 
11701 ctagtgaaat cagcgttggt gggtttaagt 
11761 gtatagcgat tacccgatga atcgctatac 
11821 aatgggtgct gagttttgac acgagaatta 
11881 tttttatctg ccgttactat caaccacgca 
11941 aagtgggcct tgataaaagt agatttcagt 
12001 agttgctgtt cactaggagc aagttgctta 
12061 caggctaatt gtaatcgagt ttcaaatggt 
12121 gcggttatat ggttgctgaa tatccgtggt 
12181 gttgctcggt cgactggatc agtcaaatcg 
12241 ataccaaaat caaaccggcc accaccagct 
12301 gcgcgcttca agcgtgtgag gatatcataa 
12361 ccctgttgcg cagccggtaa gtgtgtgctg 
12421 catttaagat aatcgagttg gttattttga 
12481 tcacggacag cggcctgcga aaggtctaag 
12541 cgatcaacat attgtaatac ccagtcagga 
12601 atcatctctg gttccaccca gagaccgaat 
12661 agggggttca gaccattagg aaacttaatg 
12721 ttttcgccgt tacggttgac aaaccagccg 
12781 tggtgggcgt gttcgattaa gtgttgaact 
12841 tcccaagtgt taagcacact tggaatgtta 
12901 tcgtgaaata cttgcgacat gccgttaaaa 
12961 ggagtctgaa aactgctatt tggtttgagc 
13021 attgttaggc ggctatgttg atattggtca 
13081 catgctagtg cacaaccaat aacagtccca 
13141 gccatgaacg gttggtgttg tggtccacta 
13201 cctgggtgta aacggttaaa cgatggatta 



aattttgata attggtagat tgcaaattta 
aacctgatac ggtgtttgct agtcgagtat 
cgtcgtcgtt aaatcttgag tactaatgtg 
acgtaaattc cgattaaagc gttcattact 
taataggtct taataccata ttgtgattca 
cggtccaccg tgaagctgtg caccggacca 
tcattaacag tcgctgtcgt ccgatctttt 
cgatcgatta aagttaataa aactgcctta 
agttcaaaat cgccgatgcg attacgttga 
cgccccaaag attgattata tttggatcgt 
tgttcaggta gatcattcaa ggagaaacca 
gatttagtag ctagtttaaa ttcgtgagca 
tggttgagaa ttttttgctt taactcatcg 
cgcttgtatt cggcatctgt ttgtgctaat 
tcataagaaa ttgttgacgg tgatcggttc 
cctagttcac aaaaggtttc gattttaatt 
tcagctccta aaagatgggt ttgtggtaaa 
cgaacagcgt cggattaatt ttacaatcta 
tttcgaattt ctcaggtgac ccagtatcgc 
ggatattatg gactggttac ggctggttca 
caaatgttcc cggggtgatt tttggattta 
cgaatgacaa attgactgta aactaaaagt 
acttttttgc aaggtgttaa gcgttcttta 
caatgccaga attgttaccc agtccagcgg 
cttcgtagcc atttagatag gcggcaaact 
gattcataac accaccgtta agaataattt 
cacatgcctg tgcgatgtag tcagtgatga 
ctttcccagc tttccccgtt cgttttccaa 
agatgtcatg gtaaggacag ttggacttga 
tgtgaccaag ctcagtatga gtgcgaccat 
cgccaacacc ggtaccaatc gtcatgtaga 
tcattgattc gccataagcg gcttcgttaa 
attgactttt aagatggcct aaaaagtcga 
taatgtaacc atatttcgga tcatggggat 
caatcgcgtc aacgggatga gtcgtaaaat 
gggttgtcgt ggggatcgaa attcggtcgc 
aaacaaattt tgtaccacca gcttcaattg 
taaaaatgta gtcttattta ataataatat 
cattaatttt gacataaata aattagctga 
ggaatcccca tcgtgtttaa ctgatgcccc 
gctaattcag catcgagata gtgtaatggt 
agcccattta agtaacagca aattgcctgc 
acaacgtgtc ttgttgttgg tagacgataa 
tgctgatatt taattaaggc tcctcgaaga 
ggattaagtt ccaatcctag ttgcccgatg 
gttattcggc cattttgcgc attgggtgat 
ggaaatagat aactaaatcc gttctcgatg 
cttatccacg tctgattggt gtatggtagc 
gaacaatttt cgataatcag cttggggcag 
agtccacaga catagcgatg atacagttcc 
ccaacctgtg taaggtggcg attcatatcc 
accaagttcg tcagagtagt gattaagtgg 
actagctggt ggcgagctgt gatgggtgtc 
tgttgttggt agagctgact gttagtggtg 
ttcattcgat gatgatgggc ttgttgggca 
gggtcaacaa accagtcacc taattgccca 
tcgtctaaga caagcatttg taagcctaac 
ttagactccg aaacggcaaa tgtcagtgtt 
gtctgggagg gcatcagctg atagctaaag 
ccagtatttg tccaagttag cacggcttca 
tgccattcga atgtgtcggg ctcaagaccg 
acttcgacag tactgtcgaa gttaccactc 
gctaattctg tcgtgttagg ttctgctaga 
gtgccgcgca gactgcgaac cgtttgtaac 
gcttcatgtg catgggtacc gcttaatgtc 
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13261 agcgcagtat attggtcagt tggtagatct agttgagcgc tggatagggc tgttacttgc 
13321 aaattagtag taccgtgatg gcgcagagtt gtgcttcgca agataagcgg ctgattttca 
13381 aaaatagtgt agttaagatc catctgtaac ttggtcactg catcggtcaa atgaatgact 
13441 agggttgtta caggcgtgtg tttgctaacg gtgggtggca attggcgcga gttgattggc 
13501 tgatcattca cagaaaaacc agtatattct aaaattggca gtaattgatt gttagcatcc 
13561 ttaataacgt aagctggctg gcgatagtca ccactgccaa tagtggaata ttcgagtggc 
13621 agtgaagtca cggaatatgg aaagcgttct gttgggtcat tagcaaaagc atggctaccg 
13681 gatggcatcg gctcaaagta cggttgtttt gaaaagtaac gtccaaaata tcgatgcact 
13741 gggtaacgat gagctaaaat ttgaataaca tagctaattt ggtcattatg aagattgatt 
13801 aagccggttt tggggtcata ttcaacaggc atgacgtcct cctaataaag tttagacagt 
13861 tgtagcagca actgagctag ctgccggagc agtcaacgca acgagctgtt caccagcttg 
13921 gacattcgtg actttgagtc gttcaacatt agcgtaattg gccgtatttg tgacgattag 
13981 catgacggtt ggatcatagt tagcggcttt taaggcagcg atgtcaaacg ttccgagtag 
14041 atcaccttga tgaacggtat cgcccttctg aacgttagta gtgaagtgtt caccatttaa 
14101 attgaccgta tctaatccta agtgaatcag aatttcggca ccagcagttg ttttgatgcc 
14161 ataagcatga tggctatcat aagtgacagt gatgaccccg tcagctggtg cgacgacctg 
14221 atcagagctt ggcacgattg cggcgccttt ccccatgatt tcggctgaga aaacttggtc 
14281 gttaacttgt ttgagacttt cgctagcccc cgttaccggt gcactgataa tctcatcatt 
14341 aacattgatg acggtactcg ttggtgctgg tgattttact tgatcacgat cgtcaccgag 
14401 tgttcgtttg gcatagataa aagttggaat aaatgcgacg acgaacgaga tgacagcact 
14461 gagcatgaat gctggaatcg actttgaagc aatcgaaatg aatccgatga ctgatgcggg 
14521 acccatcgcc acggacagaa cgtggaatag tcctaagaaa gctgaagcaa ttcctgaggc 
14581 aatcgctgca aagacaaatg gaaatttcat cttgaggttg accccgaaaa tagcaggttc 
14641 cgtaattccg agtaacgctg agaccccggc tgaagaagtc agggcttttt gcttttggct 
14701 cttagtggca aagaaaatag ctaaagtggc ggcaccttgc ccaatgttgg ccatcgaggc 
14761 gactgggaag ataaaggaac caccagtttt ggcaacattg gccaacaact gggtttcgat 
14821 tgctgggaaa gtttgatgga gaccagtaat aacaattgca gaatataata aaccaaagat 
14881 tcccatacca atccagccgg tgctgttata taagcctact aaaccgttag ttaatgcatc 
14941 gctcaccgtt cgtaagacgg gaccaacgat tgtaaaagtt aggaaaccag taatcacaat 
15001 ggcaaacatc ggggtaaacg tgaagtcaaa tgccccttta atgtgtttat ggaagaattt 
15061 ttcaagggta gctagaatga aggcgacgcc taagactggg agcacttggc cttgatagcc 
15121 ggcttgtgca acgtgtaacc caaagacgtt ccagtaaacc atcttgccgg ctgccatggt 
15181 cgttgcaacg ctatagccgt taactagtga tggtaagacc atgatcatgc ccatcgtagc 
15241 acccagatag gggttgccac cgaagcgctt cgttgctgaa aatcctaaga ggattggcaa 
15301 gaaagtaaac ggcgcactag ccatcgcgtt aatcatttcg gcgatacctt tgaggccagg 
15361 gtaaacttca acaactgact tcgccataaa aagatgctcg gcagtcaaaa cgttgtttag 
15421 tgccattaat agacccccag caactagtgc agggacgatt ggaataaaaa tatcagataa 
15481 gactttgagg aagtccatta atggattttt attttgaccc gcagcggcaa ctgccttgat 
15541 gtcatcgggt gtcacttctt taagacctgt tttaacgatt aaggcgtcat agactttatc 
15601 gacatcgcca gggccaataa tgatttggta ctggccgtta gtttcgaagg tccccttaac 
15661 gtccgcgtcg tcatctaagg cttgttgatc aatcttggat tcatccttga tgaccaaacg 
15721 gaggcgtgtc gcacagtggg cggcggcttg aatgttgttt ttaccaattg cattaagtac 
15781 gcggtcggca acttcttgat gattcataat aattcactcc ctgagtctaa aattattttt 
15841 aatttaataa tgcaagcgct ttacagaccc aataatagca attttggaaa atatgtcaag 
15901 cgtttgatac aaaattggat tgcttgataa tctggttaaa aagccgaatt tgaaacgctg 
15961 acatgaaaat ataatataaa aatgtcaaac gattgacata tcggtgtaag cggttttata 
16021 atagtgacaa ttaagtcgaa cgttaaaaag gagctcatcg tcatgatatg gaatcgtaaa 
16081 acccgttata ccccttatga acagtggcca gcaactaagc taccacagct cgttgcccag 
16141 gcacgccagt ctaagtggcg gatgcaacat catattcagc ctacatcagg tttgttaaat 
16201 gaccccaatg gcttttcata ctttgatggt cagtggcacc tgttctacca agtctttccg 
16261 ttcggtccgg tacacggact gaagtcatgg caacacgtga cctctaagaa cttggtggat 
16321 tggcatgatg agggcttagc gattcgacca gatacgccgt acgactcaca cggcgcgtac 
16381 acgggtacag cactaccaat tgatgatcag ttatttatta tgtatacggg taatgtccgt 
16441 actgccgact ggcaacgtga atcgtatcaa ttaggtgcct ggatggacac ggataatcac 
16501 attaaaaagc tgtcccggcc actaattgcg catgctccgg ctgggtacac gagttcgttt 
16561 cgtgatccgg acttgatccg gaatgaccac ggctactacg cgctgatcgg tgcgcagacg 
16621 actactgaga tcggtgctat cttggtttat ttctcaaagg atctgaccac ctggacgtgt 
16681 caaggcgagt tgaacgttcc cgcaaatgcg cggggttata tgattgagtg tccaaaatct 
16741 ggtctggatc gatcaacagc ccgtcttatt gttctgtccc aagggttatc gcaagcaacg 
16801 atcccgtatc agaatattta tccgaacatg tacttagttg ccgatcaatt gaacctagct 
16861 caggcacaat tcactgaacc ccacgccctg acgcaattgg atgacggctt tgacgtgtat 
16921 gccacgcagg caatcaatgc gccggatggc cgcgcgttag ccgtcagctg gattgggtta 
16981 cctgaaattt cctatccaac tgatcgtgaa aattgggccc attgtttgag cttagttaag 
17041 gaactaacgc ttaaagatgg gcacctctat caaaatccag tggcagcggt ggatgatttg 
17101 cggacgacgg cgcatgacct agtctttgag caacaacggg cgacggtggc cgctttgaat 
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17161 ggcagctttg aactcttact gacggttccc gctgataaaa cggtcacggt taacattgct 
17221 gatcagcagg aaagtggcca gctgcaggtg actgttgatg ccaaccatgg tcaggttatg 
17281 attgatcggc ggcatacggg gaattcgttt gctgaagatt atggtcagac tcgtcaagtt 
17341 gagttaacgg cccataagac gataaaaatc cggctaatca tagatgtgtc agtctttgaa 
17401 tgctatattg ataacggcta ttccgtgatg accggccgtt tcttcttaaa cgctaccccg 
17461 agtcgtctta acgttcaagg tgacactacg gcagttaccg gtaaagtctg ggaatggcgc 
17521 caatcagagc atactggagt cgataacaat gaaaccaaaa ttaaatgatg ttgcgaagct 
17581 agcgggggtc tcagcgacaa ccgtttctcg ggttattaat aaccatggct acctgagtag 
17641 tcagactaaa gaaaaagttt tcgcggcgat gcgagaactg cattatcagc cgaataatat 
17701 ggcacgctca ctgcaaggaa agaatacgcg cttaattggc gtcatctttt cagatattag 
17761 taatccgttc tttggtgaac tggtgtcccg gattgaaaaa atcttatttg ctaaaaatta 
17821 taaggtgatt ttatgtaata gtgcggatga tccgcaaaaa gaacgcgact acttacaaat 
17881 gttgatggct aatcaggtcg acgggattat tgctggtgcc cacaatttag gtattgaaga 
17941 atatcaacaa tatggtcttc cgattatttc atttgaccgg tacttatccg acaatattcc 
18001 catcgttagt tcggataatt accagggggg ctggctagca acccagacat tgcatcaggc 
18061 gggggcgact aacgtggcta tttttactgg taagtcacac gctggttcgc caacgaatgg 
18121 tcgccgtgaa ggatatgaag cctacttgac tgcccaacag ctaacacccc atgtccatga 
18181 gttaccgttt gaattgaccc ctgcgctgaa gatgatggaa attaagacga ttatgacaca 
18241 acatcagtat gatggtattt tttgtagtga tgatttggcc gctttactag tgctcaacgt 
18301 cgcgcaacaa ctgtcattaa cggttcctga gcaattacgc gttgttggtt atgatggcac 
18361 ggcgctcata cgtgattatc attctgaatt aacaacagta gaacaaccgt tagcggatat 
18421 tagcacgtta ctggtttctc tactattgca gcggattgag gatgcgaatt gtacactgga 
18481 atcaaaatat acgttaccgg ttaagctcat caagggcttt accgcgtaac taaggagtta 
18541 atcgtaatta tggcagcaac aatcaagtgg tggcaacaag ccgttgtcta tcaggtctat 
18601 ccacggagtt ttcaagatac caatcatgat ggtattggtg acttaaaggg gatcacggcc 
18661 cacctagact acttaaagca attaggaatc gacgtcattt ggctcaatcc catttatcgt 
18721 tcacccaacg atgataatgg ttatgatatt agtgattatc aacaaattgc ggccgacttt 
18781 ggcactatgg ctgattttga tgaactattg caagccgccc atgaccgggg cttaaagatc 
18841 atcatggact tggttgttaa ccatacctct gatgagcatc ctaggttcaa acgcagtcgc 
18901 caggaccgca ccaatcaata tcgtgatttc tacttttggc gctcaggtaa cggcaaaaag 
18961 gcgcccaata attgggaggc ggcttttggt ggttcagcat ggcaatatga tgagcagacc 
19021 cagcagtatt atttacacac gttttcaaca aaacaacctg atttaaactg ggagaatccg 
19081 accttacgag agtcagttta tacgatgatg acctggtggc tgaacaaggg tgttgacggt 
19141 tttcgaatgg acgtcattaa ccaaatctct aagttacccg gattaccaga tggtccactg 
19201 aaaccgcaca gtcagtttgg cgatgctcga gtaacgaacg gaccgcgagt tcatgaattc 
19261 ttgcaagaaa tgaaccaaga agtattgtca cagtttgata tcatgacggt tggtgagacc 
19321 cacggggtga caccagcgga tgcgctgaag . tatgcgggcg ctgatcagca cgaattagat 
19381 atggtctttg aatttcaaca tctgcgactt gataatagtc agcatggtct tggtaagtgg 
19441 agtacgcgta agacaccgtt agtggcgcta aagaaagtca ttagtgactg gcaagtcgga 
19501 cttgaagggc gggcttggaa cagcctgttt tggaacaatc atgatacgcc acgagcggtc 
19561 tcacgttttg gtgatgaccg accagcatat cgggtacgtt cggctaaaat gctcgcaacg 
19621 tgtttacacc tcttacaggg aaccccatat atttaccaag gtgaggaact ggggatgact 
19681 gacgctcatt ttactgaact agccagttac cgagatattg aatcgctcag tgcctatcgt 
19741 gatttagtga ctgagcgcca gctgctatca ccggcagata tgatggcccg attagcggcg 
19801 gcttctcggg acaactcgcg aacgccgatg caatgggaca cggaggttaa cgcgggtttt 
19861 agtgacgcag cgccatggct cacggtcaat cccaactatc gtcaaatcaa tgctgccgca 
19921 gctttggctg atccggattc agtgtggtac tattatcagc acttgattca attacgacat 
19981 caatacccct cagtgacgtt aggttcgttt gagctcttgt gggccgatga tccgcagtat 
20041 tcttatatgc acggcaatgg gaaggcggac ctggctagtt tgttgcaatt tcacagccga 
20101 gacactgtcc cgaccactgg atcaatatct gaccccacgg ccaaatgctt gattagtaat 
20161 tatggcgagc agcagccaaa taaattacgg ccttatgagg cttgggtcta ccaactcgca 
20221 taggtattca ggacttggct aattgaggtg gtctttgaaa tatacccaat cgttacgatt 
20281 agtgtgaatt gggttgttca aaattctgcg atggtcttaa taagcgacca cctgactaaa 
20341 aaaactgctt tcaaaaggat aacttttgaa agcagttttt ttagccgcgg cgaccacttg 
20401 gcccacgtga ggctcatcta gtcgttggtg accttgaatg aagcccgtga ggttaaacgg 
20461 tagttgttca tcgatcgtcg tgaccaattg accagtatct agcaacttga atagtaattg 
20521 taaggcggct tgatcagaaa tcacttcggt cggttggatt gcttggaaac gaattgattt 
20581 actgttggct ggcaagtgac gggcgaatga agcgatggtc gcatcaaatt tggccattgc 
20641 taagtcatct ttgctccaac gccatcgata gcgacattga ttacgacgtc accgcggtct 
20701 gggtctggat gtaagctgat tcctgagaca acttttaaga gagggtataa tgaataaatc 
20761 gtctctcaaa agaaaggaat ttttacctgc caactcgtta cgacaaagaa ttcaaacaaa 
20821 acattatcaa cctatataag caaggcgaat cagctgccca actggccaga gaatatggca 
20881 ttggctattc aacagttcat aagtggatcc agggccaggc caaaactcaa tccggtaaat 
20941 cgccagacga aatcaaagcg atggaaaagc gacgggcttc gctgtctgag gagaacgaaa 
21001 tcctaaaaaa gccctgggct tccttgcgca gaagtaacca atatctttga ttacgttcac 
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21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 



// 



caagaagcca 
agtattatcg 
aacaacggat 
accatgaatt 
cccggctcat 
cggcttccaa 
ctggtttaaa 
ggtgttactt 
caaaaaagat 
ccattactgg 
ttacaatcaa 
tatgataatg 
gtgccggtct 



tcaccaccag 
ttatcgatcc 
tctgcggatc 
gaatctggaa 
gaaggaactg 
aaccaaggtt 
tcaaaaatgg 
atcaaccatc 
ggctactgat 
tggacctgat 
cgtttaactg 
cgccaataga 
ttgaagatta 



gtaaccaaga 
cccaaacctt 
tttgcggaat 
cttcaaccac 
gatatccact 
gaacagcgtc 
accgctgata 
atggacctgc 
ttagtcttaa 
tatccatacg 
agctacatat 
atcctttcac 
tgaaactgtc 



tgtgccgaat 
caaaacgccg 
ttaagcagcg 
tgcagcttcg 
ccgttaccgt 
ccaacttgct 
tgacctatat 
actcacgacg 
agaccctgga 
gatttaggat 
ccgccactca 
gcttccctca 
gctgccgtcc 



cctcggtgtt 
ggccgaagat 
atacggtgtt 
gtgcagtcca 
caataagtgg 
taagcaggat 
tcaaacgaag 
gattatcggc 
aagcgcggtt 
cacaggatac 
tacagccgta 
aaaaggaatg 
tttttgaata 



tccagagctc 
gcggacttga 
atgaagatcc 
cgacggattt 
aaagcggctt 
ttctcgacca 
cgtaatggct 
tattcgttct 
aaaaatcgaa 
cagcgattga 
aggttatccg 
tgtttatcca 
tgtgcatgc 
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Links: Protein J Related Sequences [ 



LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 



SOURCE 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



repeat_unit 



repeat_unit 



repeat_unit 



terminator 



gene 



CDS 



-10_signal 



PDCRAFOPER 21838 bp DNA BCT 23-MAY-1994 

Pediococcus pentosaceus raffinose operon genes. 

L32093 

g493181 

agaR gene; agaS gene; agl gene; alpha-galactosidase; 
alpha-glucosidase; f ructokinase; insertion element; permease; rafP 
gene; rafR gene; raffinose operon; regulatory protein; scrA gene; 
scrB gene; scrK gene; scrR gene; sucrase; sucrose-6-phosphate; 
transport protein. 

Pediococcus pentosaceus (strain PPE1.0) DNA; Insertion sequence 
IS30 homolog ( transposable element Insertion sequence IS30 homolog, 
kingdom Prokaryotae) DNA; Pediococcus pentosaceus (strain PPE1.0) 
DNA; Insertion sequence IS3 homolog (transposable element Insertion 
sequence IS3 homolog, kingdom Prokaryotae) DNA; and Pediococcus 
pentosaceus (strain PPE1.0) DNA. 
Pediococcus pentosaceus 

Eubacteria; Firmicutes; Low G+C gram-positive bacteria; 
Lactobacillaceae; Pediococcus . 
1 (bases 1 to 21838) 

Leenhouts, K. J. , Bolhuis, A. A. , Kok,J.J. and Venema,G.G. 

The sucrose and raffinose operons of Pediococcus pentosaceus PPE1.0 

Unpublished (1994) 

On May 25, 1994 this sequence version replaced gi: 475106 . 
Location/Qualifiers 
1. .21838 

/organism= 11 Pediococcus pentosaceus" 
/strain="PPE1.0" 
/db_xref="taxon: 1255" 
487. .510 

/note="DR2; putative" 
/rpt__type=direct 
488. .561 

/note="IRl; putative" 
/rpt_type=inverted 
513. .561 

/note="DRl; putative" 
/rpt_type=direct 
2724. .2760 

/standard_name="terminator 4" 
/note="putative" 
complement (2929. .3864) 
/gene="rafR" 
complement (2929. .3762) 
/gene="rafR" 
/note="putative" 
/codon_start=l 
/transl table=ll 

/function=" regulation raf f inose-operon" 
/product="regulatory protein" 
/db_xref="PID: g475107" 

/translation="MNGEYKTLANKSFESNVLFFGQEACLPNYTYKGNNVRDSYVIHY 
IQEGKGTFAAANHPATVLKAGDIFILPKGTPCFYQADNDQPWKYFWIGFSAGIRIEAM 
LSGSLLAQKCYLRQVQNGHIYADLSELYKVLHIPNSLINDVLLGSLIYRLFYDLLRWY 
PADATNIKVKSTEQFNLAVSYLQENYSTGCTIMDLCHYLNLSRSYLYTLFKTHANTSP 
QKLLTKLRLEDAKQRLSTSNNSVQS IANMVGYKDS FTFSKAFKRYSGAS PS Y YRKS I G 
I" 

complement (3837 . . 3842) 
/gene="rafR u 
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gene 
CDS 



terminator 

repeat_unit 

repeat_unit 

repeat_unit 

terminator 



complement (3859 . . 3864) 

/gene="rafR" 

3909. .5918 

/gene="rafP" 

3909. .3914 

/gene="rafP" 

/note="putative n 

3932. .3937 

/gene="rafP" 

/note="putative M 

3993. . 5918 

/gene="rafP" 

/note="putative M 

/codon_start=l 

/transl table=ll 

/function="raf f inose transport protein" 

/product= n permease" 

/db_xref="PID: g 475108" 

/translation="MQEEHNYKWVGGRLIYGFGAKGNDAFYSILSGYLIIFITSHLFD 

TGNKALDNRMVSLVTLIIMVLRIVELFIDPFIGNAIDRTKNSPGHFRPWVWGGTVSS 

IILLLLFTNLGGLYAKNAMIYLVVFAI LYITMDIFYSFKDVGFWSMLPSLTTDSRERE 

KTAT FARLGS T I GGGLVGV LVMPAV I F FS AKAT S T GDNRGW F I FAL 1 1 C L I AL I S AWG 

VGLGTREVDSDIRKNKQDTVGVMEIFKALAKNDQLLWAALAYLFYGVGINILGSLEVY 

YFTYIMGKPKSFSILSIINIFLGLIATSLFPVLSKKFSRKGVFAGCLVFMLGGIAI FT 

IAGSNLWLVLLAATMFGFPQQMVFLWLMVITDSVEYGQLKLGHRDESIiALSVRPLID 

KFGGAISNGWGQIAIISGMTTGATASSITAAGQLHFKLTMFAFPALMLLIAIGIFSK 

QIFLTEEKHAEIVAELERTWRTKFDNTTDQVAEKWTSLDLATPIAGQVIPLAQVNDP 

TFAAGTLGDGFAIKPSDGRILAPFDATVRQVFTTRHAVGLVGDNGIVLLIHIGLGTVK 

LRGTGFISYVEEGQHVQQGDELLEFWDPTIKQAGLDDTVIOTVTNSTEFTMMDWLVKP 

GQAVKATDNI LQLHTKA" 

5976. . 8177 

/gene="agaR" 

5976. .8177 

/gene="agaR" 

/codon_start=l 

/transl table=ll 

/evidence=experimental 

/product="alpha-galactosidase" 

/ db_x r e f = " P I D : g 475109 " 

/translation="MSLITVDQANRVFHLHNQTLSYIFAVEQGGTLSHLYFGGHVDHY 

HGELRYPRVDRGFSGNLPGSTDRTFSRDTLPKEYSTAGEMDYHLPAAIVRHTDGANAL 

YLVYQGYRIEAGKPKLSGLPAAFVEDETEAETLTIVLVDQVSQVEFDLQYTIYRDRPV 

VTRSVQVCNQGDHAVNLEKVASMQIDFTDRQFETITLPGAHANERHPERGSINYGIQT 

FGSLRGTSSHQMNPFLALVDHTTTEFSGDAYGFNLVYSGNHAFELEKDQLDQLHIiMVG 

INSYNFNWQLKAGATFQTPEVLMVYTNKGLNAMSQAYHHLIRERVVRSEFKNQERPIV 

VNNWEATFFDFNEAKLKPIVDEAKQLGIEMFVLDDGWFGHRDDDNSSLGDWQVDHRKF 

PQGLNHFVKYVHEQGLKFGIWLE PEMI S YDSKLYQQH PDYLMQVPGRS PS PS RNQY I L 

DLGRQAVRNNIFDQLDQLLKSKQIDYIKWDMNRHLSDIYSVALPPERQGEVYHRYVLG 

LYELLERLTTAYPHILFEGCSGGGGRFDAGMAYYMPQIWASDNTDAVARLTIQYGTSL 

AYPISLATAHVSVSPNQQTGRETSMSTRSAVAASGVLGYELDLTQLSSADKQIVQKQV 

VQYKQIRPLIQFGE FYRLKS PITSNQAAWMFVS PQQDEAI VMVFNLTS YAQPS LTKTK 

LVGLNPKLNYQNIATKAIFGGDELMQLGFYDPWYQDYTTKVYHFKAVTEN" 

8265. . 8289 

/standard_name=" terminator 3" 

/note="putative" 

9403. .9426 

/note="DR2; putative" 
/rpt_type=direct 
10396. . 10442 
/note="DRl; putative" 
/rpt_type=direct 
10397. .10443 
/note="IR2; putative" 
/rpt_type=inverted 
10681. .10716 
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/standard_name=" terminator 2" 
/note^putative" 
complement (10727. .11593) 
/gene="scrK" 

complement (10727. .11593) 
/gene="scrK n 
/note= !, putative" 
/codon_start=l 
/transl table=ll 

/function= M f ructose 6-phosphotransf erase" 
/product="f ructokinase" 
/db xref= M PID:q 475110 " 

/tr!nslation="MLLGAIEAGGTKFVCATGAENGQVSDRISIPTTTPVETMTAVDD 
YFTTHPVDAIGIGSFGPIGVNPHDPKYGYITTTPKPGWGDFDFLGHLKSQFNIPLYWT 
TDVNEAAYGESMIGIAKDVPNSIYMTIGTGVGAGVISQNHIFNGRTHTELGHMRLNRL 
PGDDFKSNCPYHDICLEGLAAGPAVGKRTGKAGKDIPVDDPVWPIITDYIAQACVNLT 
VAFAPDKIILNGGVNNQRQLFPMIREKFAAYLNGYEEVPPLDDYIVPAGLGNNSGIAG 

GLLLAQAALKNA" 
complement (11673. .13832) 
/gene= M agaS" 

complement (11673. .13832) 

/gene="agaS n 

/note="putative" 

/codon_start=l 

/transl table=ll 

/product= H alpha-galactosidase" 

/db xref^PIDig'^TSm," 

/trans la tion="MPVEYDPKTGLINLHNDQISYVIQILAHRY PVHRYFGRYFSKQP 

YFEPMPSGSHAFANDPTERFPYSVTSLPLEYSTIGSGDYRQPAYVIKDANNQLLPILE 

YTGFSVNDQPINSRQLPPTVSKHTPVTTLVIHLTDAVTKLQMDLNYTIFENQPLILRS 

TTLRHHGTTNLQVTALSSAQLDLPTDQYTALTLSGTHAHEANPSFNRLHPGLQTVRSL 

RGTSGPQHQPFMALAEPNTTELAGTVIGCALAWSGNFDSTVEVDQYQHSRLTIGLEPD 

TFEWQLKPNSSFQTPEAVLTWTNTGFNGMSQVFHDFSYQLMPSQTNIPSVLNTWETLT 

FAVSESKVQHLIEHAHQLGLQMLVLDDGWFVNRNGENGQLGDWFVDPIKFPNGLNPLA 

QQAHHHRMKFGLWVE PEMI TTNSQLYQQH PDWVLQYVDRT PI TARHQLVLDLSQAAVR 

DHLITTLTNLVQNNQLDYLKWDMNRHLTQVGSTHLPAAQQGELYHRYVCGLYDILTRL 

KRACPKLIIENCSAGGGRFDFGMLPYTNQTWISDLTDPVDRATIENGFSYLFPPRIFS 

NHITAS PNAQNGRITPFETRLQLACIGQLGLELNPKQLAPSEQQLLRGALIKYQQLKS 

TFIKAHFYRLPTTRHWAWLIVTADKKQAICCYLNGLNSRVKTQHPLPLHYLDAELAY 

SDSSGNRYTGHQLNTMGIPLKPTNADFTSQLIYLCQN" 

complement (13852. .15989) 

/gene-"scrA" 

complement (13852 . . 15807) 
/gene= n scrA" 
/note="enzyme Ilabc" 
/codon_start=l 
/transl table=ll 

/function="enzyme II of the PTS system, sucrose specific" 

/evidence=experimental 

/db xref= n PID:g475112 H 

/trans la tion= ,f MNHQEVADRVLNAIGKNNIQAAAHCATRLRLVIKDESKIDQQAL 
DDDADVKGTFETNGQYQIIIGPGDVDKVYDALIVKTGLKEVTPDDIKAVAAAGQNKNP 
LMDFLKVLSDI FI PIVPALVAGGLLMALNNVLTAEHLFMAK^ 

AMASAPFTFLPILLGFSATKRFGGNPYLGATMGMIMVLPSLVNGYSVATTMAAGKMVY 

WNVFGLHVAQAGYQGQVLPVLGVAFILATLEKFFHKHIKGAFDFTFTPMFAIVITGFL 

TFTIVGPVLRTVSDALTNGLVGLYNSTGWIGMGIFGLLYSAIVITGLHQTFPAIETQL 

LANVAKTGGSFIFPVASMANIGQGAATLAIFFATKSQKQKALTSSAGVSALLGITEPA 

IFGVNLKMKFPFVFAAIASGIASAFLGLFHVLSVAMGPASVIGFISIASKSIPAFMLS 

AVI S FWAFI PTFI YAKRTLGDDRDQVKS PAPTSTVINVNDEI I SAPVTGASESLKQV 

NDQVFSAEIMGKGAAIVPSSDQWAPADGVITVTYDSHHAYGIKTTAGAEILIHLGLD 

TVNLNGE H FTTNVQKGDT VHQGDLLGT FD I AALKAANY DPT VML I VTNTAN Y ANVE RL 

KVTNVQAGEQLVALTAPAASSVAATTV" 

complement (15872. .15877) 

/gene="scrA" 

complement (15894. .15989) 



3 of 10 



9/10/98 8:24 AM 



PubMed nucleotide query 



http:/Avwwjicbijto^ 



/gene="scrA" 
-35_signal 15993 . . 15998 

/gene="scrB" 
gene 15993.. 17568 

/gene= ,, scrB" 
-10_signal 16017 . . 16022 

/gene="scrB" 
CDS 16063.. 17568 

/gene= M scrB" 

/standard__name="sucrose-6-phosphate hydrolase" 

/codon_start=l 

/transl table=ll 

/evidence=experimental 

/product= n sucrase" 

/db xref= M PID:g£75113" 

/trInslation="MIWNRKTRYTPYEQWPATKLPQLVAQARQSKWRMQHHIQPTSGL 
LNDPNGFSYFDGQWHLFYQVFPFGPVHGLKSWQHVTSKNLVDWHDEGLAIRPDTPYDS 
HGAYTGTALPIDDQLFIMYTGNVRTADWQRESYQLGAWMDTDNHIKKLSRPLIAHAPA 
GYTSS FRDPDLIRNDHGYYALIGAQTTTE I GAI LVY FSKDLTTWTCQGELNVPANARG 
YMIECPKSGLDRSTARLIVLSQGLSQATIPYQNIYPNMYLVADQLNLAQAQFTEPHAL 
TQLDDGFDVYATQAINAPDGRALAVSWIGLPEISYPTDRENWAHCLSLVKELTLKDGH 
LYQNPVAAVDDLRTTAHDLVFEQQRATVA7VLNGSFELLLTVPADKTVTVNIADQQESG 
QLQVTVDANHGQVMIDRRHTGNSFAEDYGQTRQVELTAHKTIKIRLIIDVSVFECYID 
NGYSVMTGRFFLNATPSRLNVQGDTTAVTGKVWEWRQSEHTGVDNNETKIK" 

gene 17549.. 18529 

/gene="scrR" 
CDS 17549.. 18529 

/gene="scrR n 

/note= "putative" 

/codon_start=l 

/transl table=ll 

/function="regulator of the sucrose operon" 
/product=" regulatory protein" 
/db xref="PID:q 475114 " 

/translation="MKPKLNDVAKLAGVSATTVSRVINNHGYLSSQTKEKVFAAMREL 
HYQPNNMARSLQGKNTRLIGVIFSDISNPFFGELVSRIEKILFAKNYKVILCNSADDP 
QKERDYLQMLMANQVDGIIAGAHNLGIEEYQQYGLPIISFDRYLSDNIPIVSSDNYQG 

GWLATQT LHQAGATNVAI FT GKS HAGS PTNGRRE GY E AY LTAQQLT PHVHE L P FE LT P 
ALK^IMEIKTI^ITQHQYDGIFCSDDLAALLVLNVAQQLSLTVPEQLRWGYDGTALIRD 
YHSELTTVEQPLADISTLLVSLLLQRIEDANCTLESKYTLPVKLIKGFTA" 

gene 18550.. 20223 

/gene="agl" 
CDS 18550.. 20223 

/gene="agl" 

/codon_start=l 

/transl table=ll 

/evidence=experimental 

/product="alpha-glucosidase" 

/db xref="PID:g493182" 

/ 1 ran s 1 a t i o n= " MAAT I KWWQQAWYQVY PRS FQDTNHDGI GDLKGI TAH LD Y LKQ 
LGIDVIWLNPIYRSPNDDNGYDISDYQQIAADFGTMADFDELLQAAHDRGLKIIMDLV 
VNHTSDEHPRFKRSRQDRTNQYRDFYFWRSGNGKKAPNNWEAAFGGSAWQYDEQTQQY 
YLHTFSTKQPDL^ENPTLRESVYTMTfTWWLNKGVDGFRMDVINQISKLPGLPDGPLK 
PHSQFGDARVTNGPRVHEFLQEMNQEVLSQFDIMTVGETHGVTPADALKYAGADQHEL 

DMVFE FQHLRLDNS QHGLGKWSTRKT PLVALKKVI SDWQVGLEGRAWNS LFWNNHDT P 
RAVSRFGDDRPAYRVRSAKMLATCLHLLQGTPYIYQGEELGMTDAHFTELASYRDIES 
LSAYRDLVTERQLLSPADMMARLAAASRDNSRTPMQWDTEVNAGFSDAAPWLTVNPNY 
RQINAAAALADPDSVWYYYQHLIQLRHQYPSVTLGSFELLWADDPQYSYMHGNGKADL 
ASLLQFHSRDTVPTTGSISDPTAKCLISNYGEQQPNKLRPYEAWVYQLA" 

terminator 20341. .20389 

/standard_name="terminator 1" 

/note= H putative M 

BASE COUNT 6181 a 4373 c 5015 g 6269 t 

ORIGIN 

1 gcatgcttga ctgggtgctt agccgcaaat gatcaggaga gaagcatcgc gactaggaaa 
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61 qgagggcaac acggttgctt taatcaacaa agaccagttc gatagtttca tcttcctgtt 
121 tgatgttgac agataaatga cgattaaggt cggccaccag tttgcgtgcg aaggccatct 
181 tctcgtcgaa ttcggtcatg tgcgaagtaa acgggacgct ttggtaatta aagttttcgc 
241 cgatatagcc cggatcagtc acgtagcgca agttgttttc gtcgtggcga ctgcccggtt 
301 tcaggacggt atccaaatac tgatagagaa tcgttggatc gtagttcagc ggcttatcaa 
361 gggtcgtgga aagcttgtag ccggttgctt gaccagcgaa ttgttcgttg atttagactt 
421 gggtaaggct ttgatacaat ttcaaaacga tcactcctct tcttttcact tttagtatag 
481 cagttatggt agattgtaaa attaatccga acgctgatct tttttgtccg aacagcgttc 
541 ggattaattt tacaatctac cttttaccaa gcaattgatg aaatgctagc taacgctagt 
601 cctagtgata aattgtctaa acgcttgcaa gcaatctttg cccaagcatc tatttctcgg 
661 cgtgatatgt tgggtgcaat gattggcgcg gtatgaatat tgctacagct aaaagtgaat 
721 catttgcagt taatgaattc aaacagcagt atatcgatgg ctataataat ttaagcccta 
781 ataaactatc aacagttcca gataaagtag ccaaccaagc tgagtattct caacgaattt 
841 gggtgcatgg cgacgtcatg acgacacgaa tgaaagaaac cctgaacaag ggtttgagtc 
901 gccgcatgaa aaaatcagac atcaataaaa tgacccgcag tattcctcaa agcggtgatc 
961 gaattgatga caatttagca acgcctatga accaaatgct atccagaatt catacgttaa 
1021 caactaatga agctatacgc aatagcaacg gtggtaagaa tagaacgtat gatgaacaga 
1081 atgttcagta tgtgatgtgg ttaaccgagg aagattgatc gtgtttgtga tatttgtgaa 
1141 ccacttgata gacaaacatt tgcctatgga caagcaccga ttccagtcac ggatactcat 
1201 ccaaggtgtc gttgccagct ggtagcctgt gatgaagatg ggaatctgct gtctggtgaa 
12 61 cttgataggg tatttaagga acaaaaaaag cccacttagt ttaaaactaa gtgggtcagg 
1321 aatattatta attggcttcg ctcacaacaa agtcgagggg tgctaactaa gttagcatct 
1381 ataatttagt ataagaacct taaatgaagc ttaaaaacta attctttgat ttaccgacaa 
1441 agaaagaaac tacagcaaca acaattactg ctccaataat tgaaggaata attgccattc 
1501 cggctaattg agggccccaa gatccgagta tagcttctcc aatagaggaa ccaactaatc 
1561 cggcaacaat attagcaatc caccccattg attttccttt gctagtaatt gctccagcaa 
1621 tagcaccaat gatggcacca ataattaaaa cccataacca atgcataaag catcatcctc 
1681 tcttattttt cgagggatgt tttttagatt tgttatagtt ttgaatttct ttgaaaatgg 
1741 ctaataaaag ttgagatata gaaatgataa ttgataattt tgaccaaagt ccaaatcttt 
1801 tcatatgaaa tcctccatta tctaaattgc attcttctaa acctaattat aggcaattga 
18 61 atatttgcat acaagttatt cgattgtgcc attcttttta aaataggaat ttttgatttg 
1921 cctttttctc aattacagac gttaaagaat aattgattta gttattgcac ttacaaacaa 
1981 tttagtctac ggacttaaaa aggagttttg atgatgaaaa caaatttgca attttttgct 
2041 gatcctaacc cagaacctaa gccaaacgat tctaacccaa atcctgatgt gaccttttgg 
2101 qcqtggtgtg atagacggtt cttgcatcaa gacgaaaact tctttcattg tttgttgtag 
2161 qaatttcaat ggtacctgaa tttttcgtct tggtgtcttt tttttaccat tttagctaca 
2221 tggctaactt aattataaaa cgcgcgagaa taaatttttg cgagaaaaag gaattaaaaa 
228L ccataataaa tttattctat tgaatttaaa tgactatcgt gttgctaagt taagatatcc 
2341 agtaactcaa gctagtttga atgaaatgtt aatcaaactt ggtaatgaag ccgatattac 
2401 tgcgcctaaa gagatgcagt ggtcgatgta ttcattgcgg catacagttg caacaagact 
2461 tgggaaaata cagccgggat gagttatccg tgggcggcag cgcgtatggg acatacgtta 
2521 gaagagttca tgcagactta tgtacatgtt gatcagaatc gttcgttaga aatgcttgct 
2581 aagtgggtat gattgactaa atttctcaaa cacacacacg aaacacacag aactcaaaga 
2641 taattaaata aacgcgattg ctgcgggatt tcgaagctca actttgtatc ttgcagacgc 
2701 ataaacaaaa taattcttat ttgcaaaagc cgacaatagc gcgttgattg tcggtttttg 
27 61 tttgtttaaa aaaggctgta tcagttaata tcaacacaca gcaaaattga aaaaagtcgc 
2821 gcgttattac ggtggtacca taatttttac aggacaacta ttccttatgt aattcgatag 
2881 cattagcaga aagacaaatc acacgggcaa catagtggtc attatgattt agattccaat 
2941 gcttttacga taatagctgg ggctagcacc actgtagcgt ttaaatgcct tggagaaagt 
3001 gaagctatcc ttgtagccta ccatattggc aatgctttgc actgaattat tactagtgct 
3061 tagacgctgt ttagcatctt ctaagcgcaa tttagttagt aatttttgtg gcgaggtatt 
3121 agcgtgcgtt ttaaacaagg tgtagagata gcttcgtgat aaattaagat aatgacagag 
3181 gtccataatg gtacaaccag tactataatt ttcttgcaag taagaaacag ctaagttaaa 
3241 ttgttcagtt gactttactt ttatgttggt ggcgtcagcg ggataccagc gcaataagtc 
3301 atagaacaat cgatagatca aagaccctaa tagaacatca tttatcaatg aattagggat 
3361 qtgtaaaact ttataaagtt cagagagatc ggcgtaaata tggccatttt gtacttggcg 
3421 taagtagcat ttctgagcta ataaagaacc agaaagcatt gcttcgatac ggataccggc 
3481 ggaaaaaccg atccaaaagt atttccacgg ttggtcattg tcagcttgat agaaacaagg 
3541 tgttccctta ggtagtataa aaatatcgcc ggcttttaat acggttgctg gatggttagc 
3601 tgcagcaaag gtacctttac cctcctggat gtaatggata acatagctgt ctctgacatt 
3661 attacctttg taagtgtaat tcggtaggca agcctcttgc ccgaaaaaga gtacattact 
3721 ttcgaagctt ttattcgcta aagttttata ttcaccattc atatgagttc tccatttact 
3781 taaaatatct tgctaaatat gtgaaaatgt ttttcaacta attaataaat agctacacta 
3841 tactgctatt taaaaacatg tcaaattgtt tactaataaa aataacaaaa aattgataaa 
3901 gattactatg ggcaacccaa tagacaggaa ttataatgag ctttgttaat gaaaacggat 
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3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 



acaaaaatga 
taggaggtcg 
tctccggata 
tagataatcg 
tattcattga 
tccggccatg 
ctaatttagg 
tgtatattac 
cttccttgac 
caactattgg 
ccaaggcgac 
tgatagcttt 
ttcgtaaaaa 
atgatcagtt 
taggttcact 
ttctatcgat 
cgaaaaaatt 
ttgcgatttt 
ttgggttccc 
aatacggtca 
tgatcgataa 
ctggtatgac 
aattaacgat 
agcaaatatt 
ggagaacaaa 
atttagctac 
ttgcggctgg 
ctccatttga 
gtgataatgg 
cgggatttat 
agttttggga 
ctaattcaac 
caactgataa 
ttttaaaatt 
cgaatcgcgt 
gcggtacttt 
gctatccacg 
tttcacgtga 
cagcagcaat 
atcggatcga 
aaacagaggc 
atctacaata 
atcaaggtga 
atcgtcagtt 
ggggttcgat 
aaatgaatcc 
atggttttaa 
accaacttca 
gcgcgacttt 
tgagccaagc 
aagagcgtcc 
aattgaagcc 
atggttggtt 
atagaaaatt 
aatttggtat 
atccagatta 
ttttagatct 
taaaatcgaa 
attccgttgc 
tgtatgaatt 
caggtggtgg 
gtgataacac 
caatttcgtt 
cgtcaatgtc 
taacacaact 



tttgagggca 
cctaatctat 
tttaattatc 
aatggtcagt 
tccctttatc 
ggttgtagtc 
cggcctctat 
gatggatatt 
cactgattca 
tggtggcttg 
tagtactggc 
aatttcagcc 
taaacaagat 
gctatgggca 
ggaagtctat 
cattaatatt 
cagtcgtaaa 
taccattgct 
gcaacagatg 
attgaagctg 
atttggtggc 
gactggtgcg 
gtttgccttt 
cttaaccgaa 
gttcgataat 
accaatcgct 
aacgttaggt 
tgcaacggta 
gatcgtctta 
ttcttatgtt 
tccaacgatc 
cgaattcact 
tattttacag 
agtaattccc 
ttttcatttg 
aagtcattta 
agtcgatcgt 
tactttgccg 
cgttcgtcat 
agcgggtaaa 
agaaactttg 
tacgatctac 
tcatgcagtt 
cgaaacaatt 
caattatgga 
atttttggcc 
cttggtctat 
tctaatggtc 
ccaaacacca 
ctaccaccat 
gattgtagtt 
gatcgtggat 
tggacatcgt 
tccgcagggc 
ttggttagaa 
tttaatgcaa 
tggtcggcaa 
gcaaattgat 
tttaccacca 
attggaacgg 
cggtcgattt 
tgatgcagtt 
agctaccgcc 
gactagaagt 
cagttcagcg 



ggtaattcat 
ggttttggtg 
ttcattactt 
ttggtgacct 
ggtaacgcaa 
ggtgggaccg 
gctaaaaacg 
ttctattcgt 
cgagaacgcg 
gttggtgtgt 
gataatcgtg 
tggggtgttg 
acagttggtg 
gctttagcct 
tatttcacgt 
ttcttaggtt 
ggtgtttttg 
ggcagtaatt 
gtctttttag 
gggcatcgtg 
gctatctcta 
actgcttctt 
cccgcgttaa 
gaaaaacacg 
accactgatc 
gggcaagtga 
gacggatttg 
cgtcaagtat 
ttgatccata 
gaggaggggc 
aaacaagctg 
atgatggatt 
ttgcatacta 
cgaggaggaa 
cataatcaaa 
tattttggtg 
ggtttttccg 
aaagaataca 
actgatgggg 
ccaaagttaa 
actatcgtat 
cgtgatcgac 
aatttagaaa 
acattacctg 
atacagactt 
ttggttgatc 
tctggcaatc 
ggtattaata 
gaagtattaa 
ttgatccgtg 
aacaactggg 
gaagccaagc 
gatgatgata 
ttaaatcatt 
ccagaaatga 
gttcctggcc 
gccgtacgaa 
tatattaaat 
gaacgccagg 
ttgacgacag 
gatgctggga 
gcgcgattga 
catgtctcag 
gcagtagcgg 
gataaacaaa 



ttatgcagga 
caaaaggtaa 
cacacctttt 
tgattattat 
ttgaccgaac 
tctcttcaat 
caatgattta 
ttaaagacgt 
aaaaaactgc 
tggtcatgcc 
gttggttcat 
gcttaggtac 
tgatggagat 
atttattcta 
atattatggg 
tgattgctac 
ctggctgttt 
tatggctggt 
tagttttgat 
atgagtcgct 
acggggttgt 
ctatcactgc 
tgctacttat 
ccgaaattgt 
aggtagcaga 
ttccacttgc 
ctattaaacc 
ttaccacacg 
ttggtttggg 
agcatgtaca 
ggttagatga 
ggttagtcaa 
aagcataatg 
ataaaatgtc 
ctctttctta 
ggcatgtgga 
gtaatttacc 
gtacggctgg 
ctaatgcatt 
gcgggctacc 
tagttgatca 
cagtggtcac 
aagttgcttc 
gagcgcatgc 
ttggtagttt 
acactacgac 
acgcatttga 
gttataattt 
tggtctatac 
aacgagtagt 
aagcaacatt 
agttaggcat 
attcttcatt 
ttgtgaaata 
tttcatatga 
gatcaccaag 
ataatatttt 
gggatatgaa 
gtgaggtcta 
cttatccgca 
tggcttacta 
cgattcaata 
tgagccctaa 
caagcggtgt 
tagtccaaaa 



agaacataat 
tgatgctttc 
tgataccggc 
ggtgttacga 
taaaaactcg 
tattctcttg 
tttggtcgtt 
tggtttttgg 
tacttttgcc 
ggcagttatc 
ttttgccttg 
acgtgaagtt 
ttttaaggca 
tggtgtgggc 
taagccgaaa 
atcgctattt 
agtgttcatg 
tctattagcg 
ggtaatcacg 
ggctttatca 
tggccaaatt 
tgcaggacaa 
cgctattggc 
ggctgaactt 
aaaagtagtt 
ccaagtcaat 
tagtgatggc 
acatgcagtt 
aactgttaaa 
acaaggggat 
tacggttatt 
gccaggtcaa 
tgccactggt 
attaattacg 
tatctttgca 
ccattaccac 
gggatcgact 
tgaaatggat 
atacttagtt 
ggcagctttt 
agtaagccaa 
gcgctcggtg 
aatgcaaatc 
caatgaacgc 
gcggggtact 
agagtttagc 
actggaaaag 
caattggcaa 
aaataaaggt 
acgtagtgag 
ctttgatttt 
tgaaatgttt 
aggtgactgg 
tgtgcatgaa 
ttcaaagcta 
tccatcacgt 
tgaccagctt 
tcgccacctt 
tcatcggtat 
tatcttattt 
tatgccacag 
tgggactagt 
tcagcaaaca 
cttaggttat 
gcaagttgta 



tataaatggg 
tatagtatcc 
aacaaagcac 
atcgttgagc 
ccgggtcatt 
ctattattta 
tttgcaattt 
tcaatgttac 
cgcttagggt 
ttcttttctg 
atcatctgct 
gatagtgata 
ctggctaaaa 
attaatattt 
tcattttcta 
ccagtattat 
ctaggtggta 
gcaactatgt 
gattctgttg 
gtgcggccgt 
gccattattt 
ttacatttta 
attttttcaa 
gaaagaacct 
acatcacttg 
gatccaactt 
cgaatattag 
ggcttagtcg 
cttagaggaa 
gaattacttg 
atgacagtga 
gccgttaaag 
tatcatttaa 
gttgatcaag 
gttgagcagg 
ggtgaattgc 
gatcgcactt 
tatcacttac 
tatcagggat 
gtggaggatg 
gtcgagtttg 
caagtctgta 
gattttacgg 
catccagagc 
tccagtcatc 
ggtgatgcat 
gatcaacttg 
cttaaagctg 
ttgaacgcga 
tttaaaaatc 
aacgaagcaa 
gttttagatg 
caagttgatc 
caaggcctta 
tatcagcaac 
aatcagtata 
gatcaacttc 
tcagatattt 
gttttaggtc 
gaaggttgct 
atttgggcta 
ttagcctatc 
ggtcgtgaaa 
gaattggatc 
cagtataaac 
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7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 
8941 
9001 
9061 
9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 
10621 
10681 
10741 
10801 
10861 
10921 
10981 
11041 
11101 
11161 
11221 
11281 
11341 
11401 
11461 
11521 
11581 
11641 
11701 



agattagacc 
atcaagcagc 
atctgacatc 
agctgaatta 
ttggcttcta 
cagtcactga 
gtacacagac 
gcacgcgcta 
gggagatggt 
gattacccct 
ccgagctata 
tggtgaactc 
tagtattttg 
acactctcgc 
tgacgcattt 
tggtgcgagt 
atctccaaca 
ggcttaaata 
tggcagaact 
cgtccgcagt 
tcaaagtagt 
tcattattga 
ccttcaatgt 
aggcgtggtg 
aggaacttca 
gtggctaact 
atccgaattt 
tttaagcggt 
ctcaaaacga 
accacgtcag 
agtgatacta 
ttaaaagtgg 
aaccggtagg 
ctatgcccac 
ttaaacatca 
tggtcaacgt 
attctcccct 
atcattcctg 
ctcagcttag 
tcagcctgat 
agccgaacgc 
cgttcggaat 
caccatttta 
ccatgtacgt 
cgctacaacc 
agacttataa 
tcactgtttt 
ccgctaatac 
aagcagcctg 
gaacgatata 
tttcacgaat 
tatcaggtgc 
ttggccagac 
ctgctggccc 
aatcgtcccc 
taaaaatgtg 
tgctgttagg 
catcagtggt 
agtcacccca 
taacgccgat 
agtcgtctac 
taacttgccc 
cacctaaaag 
attcttaatt 
ctagtgaaat 



actgattcaa 
atggatgttt 
ctatgcccaa 
ccaaaatatt 
tgatccagta 
gaactaactt 
gaaagggctc 
attcctttcg 
tgagccgaga 
taggaaagta 
agggtgggta 
tgttccgtta 
catgcataag 
taccagcacg 
ctactcgctt 
ggtgctgggt 
gttgcgcatc 
accggtgctt 
tatcatcaaa 
tttgccgctt 
gtttctgatt 
cggtctggtg 
gccaagcggc 
tgatagactg 
atgatacctg 
taattataaa 
ttggacaaat 
cgctggttaa 
gtccctttag 
ctggagtata 
gaccgctaaa 
ttaggaactt 
cccaaaggaa 
gaggaccaac 
tgggacgctg 
tacgccgttg 
gatttagcca 
gtgaccagct 
ttttccgacc 
aaggttgaca 
ccatttggat 
aggttatact 
aggaagctga 
gtcgtatatt 
actcgtggca 
ggattggccc 
aaccgtatat 
gtttaaacac 
agctaacaat 
gtcatccagt 
catgggaaag 
gaaggcgacg 
cggatcgtca 
agcagctaaa 
aggtaaccgg 
gttctgacta 
gacgtcctta 
ccagtagagt 
gccgggtttg 
ggggccaaat 
cgcagtcatg 
gttttcagcg 
catagtgtgc 
cgaaattttg 
cagcgttggt 



tttggtgaat 
gtttcacctc 
cccagtttga 
gctactaagg 
gtctaccaag 
ttacgtaaat 
atgggagttc 
aaatacttca 



ttcagcttga 
acgtctgagt 
taataagtct 
tccgccgtaa 
ccacagatag 
tttgccaatg 
tggccgttct 
catccgacgt 
gatatagttg 
acgggcataa 
gtaggctaag 
tatttcatag 
attaacgcgt 
gcagacgcct 
gatttaaccg 
ttcttgcatc 
aatttttcgt 
acgcgcggat 
ttgtcagcat 
tttggagtaa 
gataaaaaca 
agcatggcag 
ctcagtgcca 
agttagggct 
ccgtgatttg 
gactgtatct 
ttcaattgat 
gcgtacgcca 
attataaata 
tagacgtaaa 
acattgtgat 
tcgagataat 
attggacagc 
agacaaaaga 
tctttttgtc 
ccgcaaatca 
atgattaatg 
gttattattt 
atgcattaaa 
gtatcggcgg 
aggccaccgg 
ggtgggactt 
agctgccgtt 
gtgaggttga 
acgggaatat 
ccttctaaac 
ttgagtcgca 
atgacgccag 
gcgattccaa 
gggatgttga 
ggagtcgtgg 
gaaccgatac 
gtttcaacgg 
cccgtggcac 
ctcctgtagt 
ttaacgatta 
gggtttaagt 



tttatcgctt 
aacaagatga 
caaaaaccaa 
cgatatttgg 
attacacaac 
tgcaatagta 
caggtggatc 
gctggtgtac 
cgaaccagac 
gattgtggac 
cggtaatacc 
ctgtccgcat 
cgcgttttac 
acagtatcga 
tcaatactac 
tttgtttttt 
tacaaagtct 
ccaacggcag 
aaggccgtca 
cgattctgag 
ttaacttgcg 
aatcgaatcg 
cgttcgattt 
aagacgaaaa 
cttggtgtct 
aattttgata 
aacctgatac 
cgtcgtcgtt 
acgtaaattc 
taataggtct 
cggtccaccg 
tcattaacag 
cgatcgatta 
agttcaaaat 
cgccccaaag 
tgttcaggta 
gatttagtag 
tggttgagaa 
cgcttgtatt 
tcataagaaa 
cctagttcac 
tcagctccta 
cgaacagcgt 
tttcgaattt 
ggatattatg 
caaatgttcc 
cgaatgacaa 
acttttttgc 
caatgccaga 
cttcgtagcc 
gattcataac 
cacatgcctg 
ctttcccagc 
agatgtcatg 
tgtgaccaag 
cgccaacacc 
tcattgattc 
attgactttt 
taatgtaacc 
caatcgcgtc 
gggttgtcgt 
aaacaaattt 
taaaaatgta 
cattaatttt 
ggaatcccca 



aaaatcacca 
agcgattgtc 
attagtcggt 
tggtgatgaa 
aaaggtctat 
agttaaccag 
aggcggcgcg 
agaacctaag 
tgggtgtagc 
ctcattagtg 
ctcaagcgca 
aacgttaccg 
gttcggtaaa 
tctcaaaatg 
gaccgaatag 
ccagcagatc 
tcgtacagac 
cgtccggcga 
cttggctaaa 
cggcttcagg 
tgactcgacc 
ctatttgccg 
ccgataaatg 
cttctttcat 
ttttttacca 
attggtagat 
ggtgtttgct 
aaatcttgag 
cgattaaagc 
taataccata 



tgaagctgtg 
tcgctgtcgt 
aagttaataa 
cgccgatgcg 
attgattata 
gatcattcaa 
ctagtttaaa 
ttttttgctt 
cggcatctgt 
ttgttgacgg 
aaaaggtttc 
aaagatgggt 
cggattaatt 
ctcaggtgac 
gactggttac 
cggggtgatt 
attgactgta 
aaggtgttaa 
attgttaccc 
atttagatag 
accaccgtta 
tgcgatgtag 
tttccccgtt 
gtaaggacag 
ctcagtatga 
ggtaccaatc 
gccataagcg 
aagatggcct 
atatttcgga 
aacgggatga 
ggggatcgaa 
tgtaccacca 
gtcttattta 
gacataaata 
tcgtgtttaa 



attactagca 
atggtattta 
ttgaatccca 
ttgatgcaac 
cattttaaag 
tgccggtaaa 
gcctttaccc 
taaacggcgc 
tagatcgagt 
gcgcgttcgg 
gccgttagcg 
aattcagccg 
cgttaatagg 
accaaactcg 
gcgctgatgt 
aaggttacgg 
cattttggcc 
ccagccctca 
ttttaatggg 
actgtaaatt 
acgcttgagc 
tactgagcat 
gtgacctttt 
tgtttgttgt 
ttttggctgc 
tgcaaattta 
agtcgagtat 
tactaatgtg 
gttcattact 
ttgtgattca 
caccggacca 
ccgatctttt 
aactgcctta 
attacgttga 
tttggatcgt 
ggagaaacca 
ttcgtgagca 
taactcatcg 
ttgtgctaat 
tgatcggttc 
gattttaatt 
ttgtggtaaa 
ttacaatcta 
ccagtatcgc 
ggctggttca 
tttggattta 
aactaaaagt 
gcgttcttta 
agtccagcgg 
gcggcaaact 
agaataattt 
tcagtgatga 
cgttttccaa 
ttggacttga 
gtgcgaccat 
gtcatgtaga 
gcttcgttaa 
aaaaagtcga 
tcatggggat 
gtcgtaaaat 
attcggtcgc 
gcttcaattg 
ataataatat 
aattagctga 
ctgatgcccc 



7 of 10 



9/10/98 8:24 AM 



PubMed nucleotide query 



http-y/www Jicbi Jilm juh.gov/hftin-post/Entrez/query?uid=493 1 8 1 &form=6&db=n&Dopt=g 



11761 gtatagcgat tacccgatga atcgctatac gctaattcag catcgagata gtgtaatggt 
11821 aatgggtgct gagttttgac acgagaatta agcccattta agtaacagca aattgcctgc 
11881 tttttatctg ccgttactat caaccacgca acaacgtgtc ttgttgttgg tagacgataa 
11941 aagtgggcct tgataaaagt agatttcagt tgctgatatt taattaaggc tcctcgaaga 
12001 agttgctgtt cactaggagc aagttgctta ggattaagtt ccaatcctag ttgcccgatg 
12061 caggctaatt gtaatcgagt ttcaaatggt gttattcggc cattttgcgc attgggtgat 
12121 gcggttatat ggttgctgaa tatccgtggt ggaaatagat aactaaatcc gttctcgatg 
12181 gttgctcggt cgactggatc agtcaaatcg cttatccacg tctgattggt gtatggtagc 
12241 ataccaaaat caaaccggcc accaccagct gaacaatttt cgataatcag cttggggcag 
12301 gcgcgcttca agcgtgtgag gatatcataa agtccacaga catagcgatg atacagttcc 
12361 ccctgttgcg cagccggtaa gtgtgtgctg ccaacctgtg taaggtggcg attcatatcc 
12421 catttaagat aatcgagttg gttattttga accaagttcg tcagagtagt gattaagtgg 
12481 tcacggacag cggcctgcga aaggtctaag actagctggt ggcgagctgt gatgggtgtc 
12541 cgatcaacat attgtaatac ccagtcagga tgttgttggt agagctgact gttagtggtg 
12 601 atcatctctg gttccaccca gagaccgaat ttcattcgat gatgatgggc ttgttgggca 
12661 agggggttca gaccattagg aaacttaatg gggtcaacaa accagtcacc taattgccca 
12721 ttttcgccgt tacggttgac aaaccagccg tcgtctaaga caagcatttg taagcctaac 
12781 tggtgggcgt gttcgattaa gtgttgaact ttagactccg aaacggcaaa tgtcagtgtt 
12841 tcccaagtgt taagcacact tggaatgtta gtctgggagg gcatcagctg atagctaaag 
12901 tcgtgaaata cttgcgacat gccgttaaaa ccagtatttg tccaagttag cacggcttca 
12961 ggagtctgaa aactgctatt tggtttgagc tgccattcga atgtgtcggg ctcaagaccg 
13021 attgttaggc ggctatgttg atattggtca acttcgacag tactgtcgaa gttaccactc 
13081 catgctagtg cacaaccaat aacagtccca gctaattctg tcgtgttagg ttctgctaga 
13141 gccatgaacg gttggtgttg tggtccacta gtgccgcgca gactgcgaac cgtttgtaac 
13201 cctgggtgta aacggttaaa cgatggatta gcttcatgtg catgggtacc gcttaatgtc 
13261 agcgcagtat attggtcagt tggtagatct agttgagcgc tggatagggc tgttacttgc 
13321 aaattagtag taccgtgatg gcgcagagtt gtgcttcgca agataagcgg ctgattttca 
13381 aaaatagtgt agttaagatc catctgtaac ttggtcactg catcggtcaa atgaatgact 
13441 agggttgtta caggcgtgtg tttgctaacg gtgggtggca attggcgcga gttgattggc 
13501 tgatcattca cagaaaaacc agtatattct aaaattggca gtaattgatt gttagcatcc 
13561 ttaataacgt aagctggctg gcgatagtca ccactgccaa tagtggaata ttcgagtggc 
13621 agtgaagtca cggaatatgg aaagcgttct gttgggtcat tagcaaaagc atggctaccg 
13681 gatggcatcg gctcaaagta cggttgtttt gaaaagtaac gtccaaaata tcgatgcact 
13741 gggtaacgat gagctaaaat ttgaataaca tagctaattt ggtcattatg aagattgatt 
13801 aagccggttt tggggtcata ttcaacaggc atgacgtcct cctaataaag tttagacagt 
13861 tgtagcagca actgagctag ctgccggagc agtcaacgca acgagctgtt caccagcttg 
13921 gacattcgtg actttgagtc gttcaacatt agcgtaattg gccgtatttg tgacgattag 
13981 catgacggtt ggatcatagt tagcggcttt taaggcagcg atgtcaaacg ttccgagtag 
14041 atcaccttga tgaacggtat cgcccttctg aacgttagta gtgaagtgtt caccatttaa 
14101 attgaccgta tctaatccta agtgaatcag aatttcggca ccagcagttg ttttgatgcc 
14161 ataagcatga tggctatcat aagtgacagt gatgaccccg tcagctggtg cgacgacctg 
14221 atcagagctt ggcacgattg cggcgccttt ccccatgatt tcggctgaga aaacttggtc 
14281 gttaacttgt ttgagacttt cgctagc'ccc cgttaccggt gcactgataa tctcatcatt 
14341 aacattgatg acggtactcg ttggtgctgg tgattttact tgatcacgat cgtcaccgag 
14401 tgttcgtttg gcatagataa aagttggaat aaatgcgacg acgaacgaga tgacagcact 
14461 gagcatgaat gctggaatcg actttgaagc aatcgaaatg aatccgatga ctgatgcggg 
14521 acccatcgcc acggacagaa cgtggaatag tcctaagaaa gctgaagcaa ttcctgaggc 
14581 aatcgctgca aagacaaatg gaaatttcat cttgaggttg accccgaaaa tagcaggttc 
14641 cgtaattccg agtaacgctg agaccccggc tgaagaagtc agggcttttt gcttttggct 
14701 cttagtggca aagaaaatag ctaaagtggc ggcaccttgc ccaatgttgg ccatcgaggc 
14761 gactgggaag ataaaggaac caccagtttt ggcaacattg gccaacaact gggtttcgat 
14821 tgctgggaaa gtttgatgga gaccagtaat aacaattgca gaatataata aaccaaagat 
14881 tcccatacca atccagccgg tgctgttata taagcctact aaaccgttag ttaatgcatc 
14941 gctcaccgtt cgtaagacgg gaccaacgat tgtaaaagtt aggaaaccag taatcacaat 
15001 ggcaaacatc ggggtaaacg tgaagtcaaa tgccccttta atgtgtttat ggaagaattt 
15061 ttcaagggta gctagaatga aggcgacgcc taagactggg agcacttggc cttgatagcc 
15121 ggcttgtgca acgtgtaacc caaagacgtt ccagtaaacc atcttgccgg ctgccatggt 
15181 cgttgcaacg ctatagccgt taactagtga tggtaagacc atgatcatgc ccatcgtagc 
15241 acccagatag gggttgccac cgaagcgctt cgttgctgaa aatcctaaga ggattggcaa 
15301 gaaagtaaac ggcgcactag ccatcgcgtt aatcatttcg gcgatacctt tgaggccagg 
15361 gtaaacttca acaactgact tcgccataaa aagatgctcg gcagtcaaaa cgttgtttag 
15421 tgccattaat agacccccag caactagtgc agggacgatt ggaataaaaa tatcagataa 
15481 gactttgagg aagtccatta atggattttt attttgaccc gcagcggcaa ctgccttgat 
15541 gtcatcgggt gtcacttctt taagacctgt tttaacgatt aaggcgtcat agactttatc 
15601 gacatcgcca gggccaataa tgatttggta ctggccgtta gtttcgaagg tccccttaac 



8 of 10 



9/10/98 8:24 AM 



PubMed nucleotide query 



http-y/www jicbi Jilm jiih.gov/htbin-rwst/Entre2/query?uid=493 1 8 l&form=6&db=n&Dopt-g 



15661 gtccgcgtcg tcatctaagg cttgttgatc aatcttggat tcatccttga tgaccaaacg 
15721 gaggcgtgtc gcacagtggg cggcggcttg aatgttgttt ttaccaattg cattaagtac 
15781 gcggtcggca acttcttgat gattcataat aattcactcc ctgagtctaa aattattttt 
15841 aatttaataa tgcaagcgct ttacagaccc aataatagca attttggaaa atatgtcaag 
15901 cgtttgatac aaaattggat tgcttgataa tctggttaaa aagccgaatt tgaaacgctg 
15961 acatgaaaat ataatataaa aatgtcaaac gattgacata tcggtgtaag cggttttata 
16021 atagtgacaa ttaagtcgaa cgttaaaaag gagctcatcg tcatgatatg gaatcgtaaa 
16081 acccgttata ccccttatga acagtggcca gcaactaagc taccacagct cgttgcccag 
16141 gcacgccagt ctaagtggcg gatgcaacat catattcagc ctacatcagg tttgttaaat 
16201 gaccccaatg gcttttcata ctttgatggt cagtggcacc tgttctacca agtctttccg 
16261 ttcggtccgg tacacggact gaagtcatgg caacacgtga cctctaagaa cttggtggat 
16321 tggcatgatg agggcttagc gattcgacca gatacgccgt acgactcaca cggcgcgtac 
16381 acgggtacag cactaccaat tgatgatcag ttatttatta tgtatacggg taatgtccgt 
16441 actgccgact ggcaacgtga atcgtatcaa ttaggtgcct ggatggacac ggataatcac 
16501 attaaaaagc tgtcccggcc actaattgcg catgctccgg ctgggtacac gagttcgttt 
16561 cgtgatccgg acttgatccg gaatgaccac ggctactacg cgctgatcgg tgcgcagacg 
16621 actactgaga tcggtgctat cttggtttat ttctcaaagg atctgaccac ctggacgtgt 
16681 caaggcgagt tgaacgttcc cgcaaatgcg cggggttata tgattgagtg tccaaaatct 
16741 ggtctggatc gatcaacagc ccgtcttatt gttctgtccc aagggttatc gcaagcaacg 
16801 atcccgtatc agaatattta tccgaacatg tacttagttg ccgatcaatt gaacctagct 
16861 caggcacaat tcactgaacc ccacgccctg acgcaattgg atgacggctt tgacgtgtat 
16921 gccacgcagg caatcaatgc gccggatggc cgcgcgttag ccgtcagctg gattgggtta 
16981 cctgaaattt cctatccaac tgatcgtgaa aattgggccc attgtttgag cttagttaag 
17041 gaactaacgc ttaaagatgg gcacctctat caaaatccag tggcagcggt ggatgatttg 
17101 cggacgacgg cgcatgacct agtctttgag caacaacggg cgacggtggc cgctttgaat 
17161 ggcagctttg aactcttact gacggttccc gctgataaaa cggtcacggt taacattgct 
17221 gatcagcagg aaagtggcca gctgcaggtg actgttgatg ccaaccatgg tcaggttatg 
17281 attgatcggc ggcatacggg gaattcgttt gctgaagatt atggtcagac tcgtcaagtt 
17341 gagttaacgg cccataagac gataaaaatc cggctaatca tagatgtgtc agtctttgaa 
17401 tgctatattg ataacggcta ttccgtgatg accggccgtt tcttcttaaa cgctaccccg 
17461 agtcgtctta acgttcaagg tgacactacg gcagttaccg gtaaagtctg ggaatggcgc 
17521 caatcagagc atactggagt cgataacaat gaaaccaaaa ttaaatgatg ttgcgaagct 
17581 agcgggggtc tcagcgacaa ccgtttctcg ggttattaat aaccatggct acctgagtag 
17641 tcagactaaa gaaaaagttt tcgcggcgat gcgagaactg cattatcagc cgaataatat 
17701 ggcacgctca ctgcaaggaa agaatacgcg cttaattggc gtcatctttt cagatattag 
17761 taatccgttc tttggtgaac tggtgtcccg gattgaaaaa atcttatttg ctaaaaatta 
17821 taaggtgatt ttatgtaata gtgcggatga tccgcaaaaa gaacgcgact acttacaaat 
17881 gttgatggct aatcaggtcg acgggattat tgctggtgcc cacaatttag gtattgaaga 
17941 atatcaacaa tatggtcttc cgattatttc atttgaccgg tacttatccg acaatattcc 
18001 catcgttagt tcggataatt accagggggg ctggctagca acccagacat tgcatcaggc 
18061 gggggcgact aacgtggcta tttttactgg taagtcacac gctggttcgc caacgaatgg 
18121 tcgccgtgaa ggatatgaag cctacttgac tgcccaacag ctaacacccc atgtccatga 
18181 gttaccgttt gaattgaccc ctgcgctgaa gatgatggaa attaagacga ttatgacaca 
18241 acatcagtat gatggtattt tttgtagtga tgatttggcc gctttactag tgctcaacgt 
18301 cgcgcaacaa ctgtcattaa cggttcctga gcaattacgc gttgttggtt atgatggcac 
18361 ggcgctcata cgtgattatc attctgaatt aacaacagta gaacaaccgt tagcggatat 
18421 tagcacgtta ctggtttctc tactattgca gcggattgag gatgcgaatt gtacactgga 
18481 atcaaaatat acgttaccgg ttaagctcat caagggcttt accgcgtaac taaggagtta 
18541 atcgtaatta tggcagcaac aatcaagtgg tggcaacaag ccgttgtcta tcaggtctat 
18601 ccacggagtt ttcaagatac caatcatgat ggtattggtg acttaaaggg gatcacggcc 
18661 cacctagact acttaaagca attaggaatc gacgtcattt ggctcaatcc catttatcgt 
18721 tcacccaacg atgataatgg ttatgatatt agtgattatc aacaaattgc ggccgacttt 
18781 ggcactatgg ctgattttga tgaactattg caagccgccc atgaccgggg cttaaagatc 
18841 atcatggact tggttgttaa ccatacctct gatgagcatc ctaggttcaa acgcagtcgc 
18901 caggaccgca ccaatcaata tcgtgatttc tacttttggc gctcaggtaa cggcaaaaag 
18961 gcgcccaata attgggaggc ggcttttggt ggttcagcat ggcaatatga tgagcagacc 
19021 cagcagtatt atttacacac gttttcaaca aaacaacctg atttaaactg ggagaatccg 
19081 accttacgag agtcagttta tacgatgatg acctggtggc tgaacaaggg tgttgacggt 
19141 tttcgaatgg acgtcattaa ccaaatctct aagttacccg gattaccaga tggtccactg 
19201 aaaccgcaca gtcagtttgg cgatgctcga gtaacgaacg gaccgcgagt tcatgaattc 
19261 ttgcaagaaa tgaaccaaga agtattgtca cagtttgata tcatgacggt tggtgagacc 
19321 cacggggtga caccagcgga tgcgctgaag tatgcgggcg ctgatcagca cgaattagat 
19381 atggtctttg aatttcaaca tctgcgactt gataatagtc agcatggtct tggtaagtgg 
19441 agtacgcgta agacaccgtt agtggcgcta aagaaagtca ttagtgactg gcaagtcgga 
19501 cttgaagggc gggcttggaa cagcctgttt tggaacaatc atgatacgcc acgagcggtc 
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19561 tcacgttttg gtgatgaccg accagcatat cgggtacgtt cggctaaaat gctcgcaacg 
19621 tgtttacacc tcttacaggg aaccccatat atttaccaag gtgaggaact ggggatgact 
19681 gacgctcatt ttactgaact agccagttac cgagatattg aatcgctcag tgcctatcgt 
19741 gatttagtga ctgagcgcca gctgctatca ccggcagata tgatggcccg attagcggcg 
19801 gcttctcggg acaactcgcg aacgccgatg caatgggaca cggaggttaa cgcgggtttt 
19861 agtgacgcag cgccatggct cacggtcaat cccaactatc gtcaaatcaa tgctgccgca 
19921 gctttggctg atccggattc agtgtggtac tattatcagc acttgattca attacgacat 
19981 caatacccct cagtgacgtt aggttcgttt gagctcttgt gggccgatga tccgcagtat 
20041 tcttatatgc acggcaatgg gaaggcggac ctggctagtt tgttgcaatt tcacagccga 
20101 gacactgtcc cgaccactgg atcaatatct gaccccacgg ccaaatgctt gattagtaat 
20161 tatggcgagc agcagccaaa taaattacgg ccttatgagg cttgggtcta ccaactcgca 
20221 taggtattca ggacttggct aattgaggtg gtctttgaaa tatacccaat cgttacgatt 
20281 agtgtgaatt gggttgttca aaattctgcg atggtcttaa taagcgacca cctgactaaa 
20341 aaaactgctt tcaaaaggat aacttttgaa agcagttttt ttagccgcgg cgaccacttg 
20401 gcccacgtga ggctcatcta gtcgttggtg accttgaatg aagcccgtga ggttaaacgg 
20461 tagttgttca tcgatcgtcg tgaccaattg accagtatct agcaacttga atagtaattg 
20521 taaggcggct tgatcagaaa tcacttcggt cggttggatt gcttggaaac gaattgattt 
20581 actgttggct ggcaagtgac gggcgaatga agcgatggtc gcatcaaatt tggccattgc 
20641 taagtcatct ttgctccaac gccatcgata gcgacattga ttacgacgtc accgcggtct 
20701 gggtctggat gtaagctgat tcctgagaca acttttaaga gagggtataa tgaataaatc 
20761 gtctctcaaa agaaaggaat ttttacctgc caactcgtta cgacaaagaa ttcaaacaaa 
20821 acattatcaa cctatataag caaggcgaat cagctgccca actggccaga gaatatggca 
20881 ttggctattc aacagttcat aagtggatcc agggccaggc caaaactcaa tccggtaaat 
20941 cgccagacga aatcaaagcg atggaaaagc gacgggcttc gctgtctgag gagaacgaaa 
21001 tcctaaaaaa gccctgggct tccttgcgca gaagtaacca atatctttga ttacgttcac 
21061 caagaagcca tcaccaccag gtaaccaaga tgtgccgaat cctcggtgtt tccagagctc 
21121 agtattatcg ttatcgatcc cccaaacctt caaaacgccg ggccgaagat gcggacttga 
21181 aacaacggat tctgcggatc tttgcggaat ttaagcagcg atacggtgtt atgaagatcc 
21241 accatgaatt gaatctggaa cttcaaccac tgcagcttcg gtgcagtcca cgacggattt 
21301 cccggctcat gaaggaactg gatatccact ccgttaccgt caataagtgg aaagcggctt 
21361 cggcttccaa aaccaaggtt gaacagcgtc ccaacttgct taagcaggat ttctcgacca 
21421 ctggtttaaa tcaaaaatgg accgctgata tgacctatat tcaaacgaag cgtaatggct 
21481 ggtgttactt atcaaccatc atggacctgc actcacgacg gattatcggc tattcgttct 
21541 caaaaaagat ggctactgat ttagtcttaa agaccctgga aagcgcggtt aaaaatcgaa 
21601 ccattactgg tggacctgat tatccatacg gatttaggat cacaggatac cagcgattga 
21661 ttacaatcaa cgtttaactg agctacatat ccgccactca tacagccgta aggttatccg 
21721 tatgataatg cgccaataga atcctttcac gcttccctca aaaaggaatg tgtttatcca 
21781 gtgccggtct ttgaagatta tgaaactgtc gctgccgtcc tttttgaata tgtgcatg 
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TITLE 
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source 
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ECORAF 5284 bp DNA , BCT 15-JUN-1990 

E.coli rafA, rafB, and rafD genes encoding alpha-D-galactosidase, 
raf-permease, and raf-invertase, complete cds. 
M27273 
gl47504 

alpha-D-galactosidase; raf-invertase; raf-permease; rafA gene; rafB 
gene; rafD gene. 
E.coli (strain K-12) DNA. 
Escherichia coli 

Eubacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; 
Escherichia. 
1 (bases 1 to 5284) 

Aslanidis, C. , Schmid,K. and Schmitt,R. 

Nucleotide sequences and operon structure of plasmid-borne genes 
mediating uptake and utilization of raffinose in Escherichia coli 
J. Bacterid . 171, 6753-6763 (1989) 
90078124 

Draft entry and computer readable copy of sequence [1] kindly 
provided by R.Schmitt, 01-SEP-1989. 
Location/Qualifiers 
1. .5284 

/organism= !, Escherichia coli" 
/db_xref="taxon: 562" 
gnal 12.. 17 

gnal 36.. 41 

58. .63 

/note="S.D. sequence (ribosome binding site)" 
70. .2196 

/note="alpha-D-galactosidase u 
/codon_start=l 
/transl table=ll 
/db_xref ="PID : g l47505 " 

/translation="MISKYCRLSSPRSDLIIKTHPHAEIIWWGSALKHFSPDDCASLE 
RPVANGRLDIDTPLTLIAENALGLFSSPGLEGHRNGLDASPVFYTVDVEHTENTLRLT 
SEDSVAGLRLVSELVMTPSGILKVRHALTNLREGDWQINRFAITLPVAERAEEVMAFH 
GRWTREFQPHRVRLTHDAFVLENRRGRTSHEHFPALIVGTPGFSEQQGEVWAVHLGWS 
GNHRMRCEAKTDGRRYVQAEALWMPGEKALRKNETLYTPWLYACHSADGLNGMSQQYH 
RFLRDEIIRFPEQKLRPVHLNTWEGIYFNHNPDYIMQMAERAAALGVERFIIDDGWFK 
GRNDDRAALGDWYTDEQKYPNGLMPVINHVKSLGMEFGIWVE PEMINPDSDLFRLHPD 
WILSMPGYSQPTGRYQYVLNLNIPEAFDYIYKRFLWLLGEHPVDYVKWDMNRELVQAG 
HEGRAAADAQTRQFYRLLDLLRERFPHVEFESCASGGGRIDFEVLKRTHRFWASDNND 
ALERCTIQRGMSYFFPPEVMGAHIGHRRCHATFRQHSIAFRGLTALFGHMGLELDPVA 
ADAKESDGYRRYALLYKEWRQLIHTGVLWRVDMPDSSIQVQGWSPDQSQALFMISQL 
AMPDYTLPGILRFPGLAAEVRYRLRVIDHPEIQLVGEGGHTMRRLPAWMNQPLEASGE 
WLAKGGIQLPVLDPESAILIALERAV" 

RBS 2244. .2249 

/note="S.D. sequence (ribosome binding site)" 

CDS 2259. .3536 

/note=" raf-permease" 
/codon_start=l 
/transl table=ll 
/db_xref ="PID : gl47506" 

/translation="MNSASTHKNTDFWIFGLFFFLYFFIMATCFPFLPVWLSDWGLS 
KTDTGIVFSCLSLFAISFQPLLGVISDRLGLKKNLIWSISLLLVFFAPFFLYVFAPLL 
HLNIWAGALTGGVFIGFVFSAGAGAIEAYIERVSRSSGFEYGKARMFGCLGWALCATM 
AGILFNVDPSLVFWMGSGGALLLLLLLYLARPSTSQTAMVMNALGANSSLISTRMVFS 
LFRMRQMWMFVLYTIGVACVYDVFDQQFAIFFRSFFDTPQAGIKAFGFATTAGEICNA 
IIMFCTPWIINRIGAKNTLLVAGGIMTIRITGSAFATTMTEWILKMLHALEVPFLLV 
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GAFKYITGVFDTRLSATVYLIGFQFSKQLAAILLSTFAGHLYDRMGFQNTYFVLGMIV 

LTVTVISAFTLSSSPGIVHPSVEKAPVAHSEIN" 
RBS 3523.. 3528 

/note="S.D. sequence (ribosome binding site)" 
CDS 3536.. 4966 

/note= ll raf-invertase n 

/codon_start=l 

/transl table=ll 

/db xref="PID:g l47507 " 

/translation="MKQRLSLAQSALEKLSARRGNTWYPIFHLAPPAGWMNDPNGLIY 
FNGRYHAFFQHHPASAYQGPMHWGHATSTDMLHWQHELVALAPGDKYDRDGCFSGSAV 
DDDGVLSLIYTGHICLEDRGNDSIIREVQCLATSHDGIRFEKQGCVLTPPEGIMHFRD 
PKVWHEDGSWWMVIGARDASDNGQVLLYRGTSLRDWHLEHVLAHSAAGESYMWECPDF 
FRCGNFHWLMFSPQGMNPSGYRFRNLFQSGVLAGNWKPGSVFALKGVFEELDYGHDFY 
APQSMLAEDGRRI IMAWMNMWDS PVPTRSEAWAGCLTLPREVFERDGRLCQRPVREVE 
SLRRKCQPLSPVRLHGVQLLTENVQAAELLVTWHTVDSHAEHYGIRLGEGLRFYVDNQ 
AGRLILWRYYPEEGLDGYRSVELPDTEYLTLRIFLDRSSVEVFVNDGEATLSSRIYPQ 
ADSRQLSLYAAHGDAILTDGTLWMLT" 

BASE COUNT 1199 a 1166 c 1406 g 1513 t 

ORIGIN 

1 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg 
61 agatgtctta tgatttcaaa gtactgcaga ctgagcagtc ctcgctctga tttaattatt 
121 aaaacccatc cgcatgcaga aattatctgg tggggctctg cactgaaaca tttctcaccg 
181 gatgactgtg ccagcctgga aagaccagtt gcgaatggtc gtctggatat tgatacgcca 
241 ctgacactga tcgctgaaaa tgcccttgga ctttttagtt ctccgggact ggaaggacac 
301 agaaatggac tggatgcatc tcctgttttt tatacagttg acgtggaaca taccgaaaac 
361 accctgagac ttaccagtga agattcggta gccggcctgc gtctggtcag cgagctggtg 
421 atgacgccat cggggattct gaaagttcgt catgcactga ccaacctcag agagggagac 
481 tggcagataa atcgtttcgc aatcacttta cctgtagctg aacgtgcgga agaagtcatg 
541 gcctttcacg gacgctggac tcgtgaattt cagccgcaca gggtacgtct tactcatgat 
601 gcttttgttc tggaaaatcg cagagggcgg acatctcatg agcattttcc ggcgctgatt 
661 gtcggcacac caggcttctc ggaacaacag ggagaggtgt gggctgtgca tctggggtgg 
721 agtggaaatc accgcatgag atgtgaggca aaaactgatg gcaggcgtta cgtacaggct 
781 gaggctctgt ggatgccggg tgagaaggct ctcaggaaga atgaaaccct gtacaccccg 
841 tggctatatg cctgccactc tgcggatggc ctgaatggaa tgagtcagca ataccatcgt 
901 tttttgcgtg atgaaattat ccgtttccct gagcaaaaac tccgccccgt acatctcaat 
961 acctgggaag gtatttattt caatcacaat cctgattaca tcatgcagat ggctgagcgt 
1021 gcagcagcac tgggcgttga acgtttcatt attgatgatg gctggtttaa aggacgtaac 
1081 gatgaccgcg cggctctggg cgactggtat accgatgaac agaaataccc gaacgggctg 
1141 atgccggtta ttaatcatgt gaaatctctc ggtatggagt ttggtatatg ggttgaacca 
1201 gaaatgatta atccggattc tgacctgttt cgtcttcatc cggactggat attgtcaatg 
1261 ccaggatatt ctcagccaac cggaagatat cagtatgttc ttaacctgaa tattccggag 
1321 gcctttgatt atatttataa acgtttctta tggttactgg gagaacatcc ggttgattat 
1381 gtgaaatggg acatgaatcg tgagcttgta caggcagggc atgaaggccg tgcggcagca 
1441 gatgcacaga cccgtcagtt ctatcgattg cttgatctcc tccgtgaacg ttttccacat 
1501 gttgagtttg agtcctgtgc ttccggtggg gggcgtattg acttcgaagt cctgaaacgc 
1561 acacaccggt tctgggcatc tgacaataat gatgccctgg agcgctgcac catacaacgt 
1621 ggcatgagtt actttttccc tcctgaggtg atgggggcac atattggcca tcgccgctgc 
1681 catgcaactt tccggcagca cagcatcgct tttcgtgggc tgacggcatt gttcggccat 
1741 atggggctgg agctggatcc ggtagccgca gatgcgaagg aatctgatgg ttatcgccgg 
1801 tatgccttgc tctataaaga atggcgacaa ctgattcata caggtgttct ctggcgtgtg 
1861 gatatgccgg attcttcgat acaggttcag ggagtcgtca gccctgatca gtctcaggct 
1921 ctttttatga tcagccagct tgcaatgccg gattacacct taccaggcat acttcgtttt 
1981 cccggactgg cggcagaagt gcgttaccgg cttcgggtta ttgatcaccc ggaaattcaa 
2041 ctggttggtg aaggtggtca taccatgcgc agattaccag cctggatgaa ccagcccctc 
2101 gaggccagtg gtgaatggct ggcaaaggga gggattcagc tccccgtact ggatcctgag 
2161 agcgcgattt tgatagcact tgaaagagct gtgtgaagag agatacggag cctcagctcc 
2221 gtcctgtttc tcaatattat cccgtgaggg taagaataat gaattctgca agtacgcata 
2281 aaaatactga tttctggatt ttcggtctgt tcttttttct ctactttttc atcatggcaa 
2341 cctgttttcc gtttttgcct gtatggctgt ccgatgtggt cggactgagt aaaacggata 
2401 caggtatagt cttttcatgt ctgtctctgt ttgccatcag tttccagcca ttgcttggag 
2461 tcatatcaga tcgcctggga ctgaaaaaaa acctgatctg gagtatcagc ctgttactgg 
2521 tatttttcgc cccctttttt ttatatgtat ttgcgcccct gttgcacctc aatatctggg 
2581 caggtgcact gactggcggt gtctttattg gttttgtttt ttctgcaggt gccggagcta 
2641 ttgaagctta tatagagcgg gtcagtcgca gtagtggatt tgaatacggt aaagcgagga 
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2761 

2821 
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3001 
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4201 
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4381 

4441 

4501 

4561 

4621 

4681 

4741 
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4921 

4981 

5041 
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5161 

5221 

5281 



// 



tgttcgggtg 

atccttctct 

atctggcgcg 

cttccctgat 

ttgttctata 

tattttttcg 

ccactgcggg 

gcattggtgc 

gttctgcttt 

aagttccatt 

tgtcagcgac 

tctctacctt 

tcgggatgat 

ggatcgtaca 

acagcgtctt 

ctggtacccg 

gatttatttc 

ggggccaatg 

tgtcgcgctg 

ggatgatgat 

taatgacagc 

tgagaagcag 

agtctggcac 

tgggcaagtt 

tgctcattcc 

taattttcac 

taaccttttt 

gaaaggggta 

ggctgaggac 

tacccgtagt 

tggccggctg 

attatcccct 

attactggtg 

agaagggctg 

ccctgaggaa 

tcttcgtatt 

cttatcaagc 

tggcgatgcg 

agtatttttt 

tgaaaataat 

tgatttattg 

gcactggtat 

gttcaatatt 

cgac 



tctgggttgg 
ggttttctgg 
ccccagtacc 
ttctaccaga 
tacgataggt 
ttcatttttt 
ggagatttgt 
caaaaatacc 
tgccaccacc 
tttgctggtt 
tgtttattta 
tgccggccac 
tgttctgact 
cccttctgtt 
tctttggcac 
atttttcatc 
aatggacgtt 
cactgggggc 
gcacccggag 
ggcgtgctat 
attatccgtg 
ggctgtgtgc 
gaagacggct 
ctgttgtatc 
gcagccggag 
tggctgatgt 
cagagcggtg 
tttgaagagc 
ggcaggcgta 
gaagcctggg 
tgccagcgac 
gtaaggttac 
acgtggcata 
cggttctatg 
gggctggatg 
ttcctggatc 
cgtatttatc 
atattaactg 
cctgacatta 
ggagaatgaa 
ccggagtcat 
ccggggggat 
gcggcgcatt 



gcgttatgtg 
atggggtcag 
agccagacgg 
atggtcttca 
gtggcctgtg 
gacactccgc 
aatgccatta 
ctgcttgttg 
atgacagaag 
ggggcattca 
attggttttc 
ctgtatgatc 
gttaccgtta 
gaaaaagccc 
agtctgccct 
tggctccacc 
accatgcgtt 
atgccaccag 
ataaatatga 
cacttattta 
aagtacagtg 
tgacacctcc 
cctggtggat 
gcgggacatc 
aaagttatat 
tctcaccaca 
ttttggcagg 
tggattatgg 
tcattatggc 
caggatgtct 
ctgtgcgtga 
atggtgtgca 
cggttgacag 
tggataatca 
gttaccgcag 
gctcatctgt 
cgcaagcgga 
atggtacttt 
tctgtcacta 
tgcgtaaagt 
ctggtaagtt 
gcaagtgaat 
ggctacaaca 



caactatggc 

gaggcgcatt 

caatggttat 

gcctgtttcg 

tctatgatgt 

aggcaggaat 

tcatgttctg 

cggggggaat 

tggtgattct 

aatatattac 

agttttccaa 

gtatgggatt 

tatcagcttt 

ctgtagcaca 

tgaaaaactt 

tgccgggtgg 

cttccagcat 

taccgacatg 

tcgtgatggc 

taccggccat 

tctggctacc 

cgaaggtatt 

ggtcattggt 

tttgcgggac 

gtgggaatgc 

ggggatgaac 

gaactggaag 

tcatgacttt 

atggatgaat 

gacgctgccc 

agttgaatct 

attactgacc 

tcacgcggag 

ggccgggcga 

tgttgaactt 

tgaagtgttt 

ctcgagacaa 

atggatgctg 

ctttgtcagt 

aaaagtatgg 

attgccttgt 

aaacaggcac 

gcaaaaggag 



cggaatactt 
attgttgctg 
gaatgcactg 
tatgcgtcag 
atttgatcag 
aaaggcattc 
tacaccatgg 
tatgactatc 
gaaaatgctt 
gggcgtgttt 
acaacttgct 
ccagaatacg 
cacgctgagt 
ttcggagatt 
agcgcacgtc 
atgaatgatc 
catccagcaa 
ttgcactggc 
tgtttttcag 
atttgtctcg 
agtcatgatg 
atgcattttc 
gcccgggacg 
tggcatctgg 
cccgatttct 
ccttccggtt 
cctggttctg 
tatgccccac 
atgtgggatt 
agagaagttt 
ctgcgcagaa 
gaaaatgtac 
cactatggca 
ctgatactgt 
cctgatacag 
gttaacgatg 
ttatcgttat 
acctgagaca 
atatagtgtc 
actcctgatg 
ccggggcggg 
tggtagcagc 
cgatgacagc 



tttaatgtcg 

cttctgttgt 

ggtgccaatt 

atgtggatgt 

cagtttgcca 

ggatttgcca 

ataattaatc 

cgcattaccg 

cacgctcttg 

gacacccgac 

gcaatacttc 

tattttgtgc 

tcttcccctg 

aactgatgaa 

gcggtaatac 

caaatggcct 

gcgcatatca 

aacacgagct 

ggagtgccgt 

aagatcgtgg 

gtattcgctt 

gtgatcccaa 

cttctgacaa 

agcatgttct 

tcaggtgtgg 

accggttccg 

tctttgcgct 

agtccatgct 

cacccgtgcc 

ttgagcgcga 

aatgccagcc 

aggccgcaga 

tccgtctcgg 

ggcgttatta 

aatatctgac 

gtgaggcaac 

atgccgctca 

tatcgatatc 

atacgatgtg 

atacggttgt 

ggggctggca 

tattgcacag 

actacctcgt 
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